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(54) A processor for executing Instructions from memory according to a program counter, and 
a compiler, an assembler, a linker and a debugger for such a processor 



(57) When a branch instruction is decoded by the 
instruction decoders 409a ~ 409c, the upper 29 bits of 
the PC relative value included in the branch instruction 
are sent to the upper PC calculator 41 1 and the lower 3 
bits are sent to the lower PC calculator 405. The lower 
PC calculator 405 adds the lower 3 bits of the PC relative 



value and the lower 3 bits of the present lower PC 404 
and sends the result to the lower PC 404 as the updated 
lower PC. The upper PC calculator 411 adds the upper 
29 bits of the PC relative value, the upper 29 bits of the 
present upper PC 403, and a carry that may be received 
from the lower PC calculator 405, and sends the result 
to the upper PC 403 as the updated upper PC. 
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Description 

[0001] This application is based on an application No. 
H1 0-1 18326 filed in Japan, the content of which is here- 
by incorporated by reference. 

BACKGROUND OF THE INVENTION 

1 . Field of the Invention 

[0002] The present invention relates to a processor 
for executing instructions in units that are unrelated to 
the units in which instructions are read, and a compiler, 
an optimization apparatus, an assembler, a linker, a de- 
bugger and a disassembler for such processor. 

2 Description of the Prior Art 

[0003] Processors conventionally read and execute 
instructions stored in memory according to a program 
counter. Fig. 1 is a block diagram showing the basic con- 
struction of an example processor. 
[0004] The instruction memory 4301 stores four 8-bit 
instructions as one instruction packet. 
[0005] The program counter 4300 indicates the ad- 
dress of an instruction packet in the instruction memory 
4301. 

[0006] The instruction reading unit 4302 reads the in- 
struction packet indicated by the program counter 4300 
from the instruction memory 4301 . 
[0007] The instruction executing unit 4303 executes 
all four instructions included in the read instruction pack- 
et in one cycle. 

[0008] In this way, a conventional processor can read 
an instruction packet that is indicated by the program 
counter and can execute four instructions in the instruc- 
tion packet. 

[0009] The above processor has to execute all of the 
instructions in the read instruction packet in one cycle. 
Accordingly, when one or more instructions in an in- 
struction packet cannot be executed due to problems 
with computer system resources such as memory or I/ 
O, none of the instructions in the instruction packet can 
be executed until such problems are resolved. This 
slows program execution. 

SUMMARY OF THE INVENTION 

[0010] In view of the stated problems, it is a primary 
object of the present invention to provide a processor 
that executes instructions in units that are unrelated to 
the units in which instructions are read from a program 
and a program development environment for generating 
suitable programs. 

[0011] This primary object is achieved by a processor 
for reading instructions from a memory according to a 
program counter, the memory storing instructions in 
one-byte units, and for executing the read instructions, 



2 

the program counter including a first program counter 
and a second program counter, the first program counter 
indicating a storage position of a processing packet in 
the memory, the processing packet being composed of 
5 an integer number of the one-byte units, the second pro- 
gram counter indicating a position of processing target 
instruction in the processing packet, the processing tar- 
get instruction being an operation to be executed by the 
processor 

w [0012] With the stated construction, the first program 
counter indicates a storage position in the memory of a 
processing packet whose size is an integer number of 
bytes. Reads from the memory are performed based on 
this first program counter. The second program counter 

15 can indicate any position of a processing target instruc- 
tion included in the processing packet read from the 
memory. As a result, the instruction(s) to be executed 
can be freely set regardless of the amount of data read 
in one read operation. This means that instructions 

20 whose word length is not an integer number of bytes can 
be executed even when read operations from the mem- 
ory to the processor are performed in units of an integer 
number of bytes. 

[0013] Here, the processor may include a first pro- 
25 gram counter updating unit and a second program coun- 
ter updating unit, the second program counter updating 
unit incrementing a value of the second program counter 
in accordance with an amount of instructions that were 
executed in a preceding cycle and sending any carry 
30 generated in an incrementing to the first program coun- 
ter updating unit, and the first program counter updating 
unit adding the carry received from the second program 
counter updating unit to the value of the first program 
counter. 

35 [0014] With the stated construction, the value of the 
program counter is incremented by the amount of in- 
structions that have just been executed, so that the pro- 
gram counter can be updated to indicate the first posi- 
tion of the instructions to be executed in the next cycle. 
40 [0015] Here, the processor may further include: a pro- 
gram counter relative value extracting unit for extracting, 
when an instruction being executed includes a program 
counter relative value that is based on an address of a 
first instruction executed in a present cycle, the program 
45 counter relative value; and a calculating unit for adding 
the program counter relative value to the value of the 
first program counter and the value of the second pro- 
gram counter, and setting an addition result as the value 
of the first program counter and the value of the second 
so program counter. 

[0016] When the processor executes a branch in- 
struction, the value of the program counter is added to 
a program counter relative value that is a difference in 
addresses between the present branch instruction and 
55 the branch destination instruction. The result of this ad- 
dition is then set as the new value of the program coun- 
ter to have the program counter indicate the branch des- 
tination instruction. 
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[0017] Here, the calculating unit may include a first 
calculating unit and a second calculating unit, the sec- 
ond calculating unit adding the value of the second pro- 
gram counter and lower bits of the program counter rel- 
ative value, setting a result of an addition as the value 
of the second program counter, and sending any carry 
generated in the addition to the first calculating unit, and 
the first calculating unit adding the value of the first pro- 
gram counter, upper bits of the program counter relative 
value, and any carry received from the second calculat- 
ing unit, and setting a result of an addition as the value 
of the first program counter. 

[0018] When the processor executes a branch in- 
struction and the program counter and a program coun- 
ter relative value are added, a carry generated when cal- 
culating the lower bits is properly considered when cal- 
culating the upper bits. In this way, addresses can be 
calculated with proper continuity between the calcula- 
tion of the lower bits and the calculation of the upper bits. 
[0019] Here, the calculating unit may include a first 
calculating unit and a second calculating unit, the sec- 
ond calculating unit adding the value of the second pro- 
gram counter and lower bits of the program counter rel- 
ative value without generating a carry, and setting a re- 
sult of an addition as the value of the second program 
counter, the first calculating unit adding the value of the 
first program counter and upper bits of the program 
counter relative value, and setting a result of an addition 
as the value of the first program counter. 
[0020] When the processor executes a branch in- 
struction, calculation of the lower bits of the value of the 
program counter and the program counter relative value 
by the second calculating unit does not generate a carry 
to the calculation of the upper bits of the value of the 
program counter and the program counter relative value 
by the first calculating unit. As a result, the calculations 
of the first and second calculators can be performed in- 
dependently of one another, so that a simplified hard- 
ware construction can be used. 
[0021] Here, the calculating unit may add the value of 
the first program counter and upper bits of the program 
counter relative value, sets a result of an addition as the 
value of the first program counter, and sets lower bits of 
the program counter relative value as the value of the 
second program counter. 

[0022] When the processor executes a branch in- 
struction, no calculation using the value of the second 
program counter and the lower bits of the program coun- 
ter relative value is required, so that the processor can 
execute branch instructions at a higher speed. 
[0023] Here, the calculating unit may add the program 
counter relative value and a value whose upper bits are 
the value of the first program counter and lower bits are 
the value of the second program counter, and sets upper 
bits of a result of an addition as the value of the first 
program counter and lower bits of the result as the sec- 
ond program counter. 

[0024] When the processor executes a branch in- 



struction, the calculation using the value of the program 
counter and the program counter relative value can be 
performed by a standard calculator. This means the 
hardware construction of the processor can be simpli- 
5 fied. 

[0025] Here, the processor may further include: a pro- 
gram counter relative value extracting unit for extracting, 
when an executed instruction includes a program coun- 
ter relative value that is based on an address of the ex- 

10 ecuted instruction, the program counter relative value; 
a program counter amending unit for amending the val- 
ue of the first program counter and the value of the sec- 
ond program counter to indicate an address of the exe- 
cuted instruction; and a calculating unit for adding the 

15 program counter relative value, the value of the first pro- 
gram counter, and the value of the second program 
counter, and setting a result of an addition as the value 
of the first program counter and the value of the second 
program counter. 

20 [0026] The program counter relative value is the dif- 
ference in addresses between a branch instruction and 
the branch destination instruction, so that it will not be 
necessary to change the program counter relative value 
even when there is a change in the boundaries marking 

25 which instructions in the program will be executed in par- 
allel. 

[0027] Here, the processor may further include: a pro- 
gram counter relative value calculating instruction de- 
coding unit for decoding a program counter relative val- 
30 ue calculating instruction that performs an addition us- 
ing a program counter relative value and one of (a) a 
value of the program counter stored in a register, and 
(b) the value of the first program counter and the value 
of the second program counter; a calculating unit for per- 
35 forming the addition indicated by the program counter 
relative value calculating instruction to generate an ad- 
dition result; and a program counter value updating unit 
for storing the addition result in one of (a) the register, 
and (b) the first program counter and the second pro- 
40 gram counter. 

[0028] With the stated construction, it is possible to 
use an instruction that indicates a calculation using the 
value of the program counter and a program counter rel- 
ative value in place of an instruction that stores the ab- 
45 solute address of a function into a register. A program 
counter relative value has a shorter bit width that the 
absolute address of an instruction, so that the overall 
code size can be reduced. When using PIC codes where 
the addresses of instructions in memory are only deter- 
so mined when the program is executed, absolute address- 
es cannot be used, so that calculation instructions that 
use the program counter and a program counter relative 
value are essential. 

[0029] Here, the first program counter may indicate a 
55 memory address, the memory address being a storage 
position in the memory of a processing packet that is 
given by bit shifting the value in the first program counter 
by log 2 n bits in a leftward direction, n being a length of 
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a processing packet in bytes. 

[0030] With the stated construction, while separate 
addresses are assigned to each one-byte storage pack- 
et in the memory, the value of the first program counter 
corresponds with the address of a processing packet in 
the memory. As a result, the processor can easily spec- 
ify a processing packet in the memory. 
[0031] Here, the processor may further include: an in- 
struction buffer for temporarily storing instructions; and 
an instruction reading unit for transferring instructions 
with a minimum transfer size of one one-byte unit from 
the memory to the instruction buffer, in accordance with 
available space in the instruction buffer but regardless 
of a size of a processing packet. 
[0032] With the stated construction, the amount of da- 
ta read by the processor from the memory in one read 
operation can be freely set, so that the construction in 
the processor for reading instructions can be made high- 
ly flexible. 

[0033] The stated primary object can also be 
achieved by an instruction sequence optimizing appa- 
ratus, for generating optimized code from an instruction 
sequence, including: an address assigning unit for esti- 
mating a size of each instruction in the instruction se- 
quence and assigning an address to each instruction, 
upper bits of each address indicating a memory address 
at which a processing packet is stored and lower bits of 
each address indicating a processing target instruction 
in the processing packet; a label detecting unit (1) for 
detecting a label, which should be resolved by an ad- 
dress of a specified instruction, from the instruction se- 
quence, and obtaining the address of the specified in- 
struction, and (2) for detecting a label, which should be 
resolved by a difference in addresses of two specified 
instructions, from the instruction sequence, and obtain- 
ing the addresses of the two specified instructions; a 
program counter relative value calculating unit for cal- 
culating, when a label which should be resolved by a 
difference in addresses of two specified instructions has 
been detected, a program counter relative value by sub- 
tracting an address of one of the two specified instruc- 
tions from an address of another of the two specified 
instructions; a converting unit (1) for converting an in- 
struction that has a label that should be resolved by an 
address of a specified instruction into an instruction with 
a size that is based on a size of the address of the spec- 
ified instruction, (2) for converting an instruction that has 
a label that should be resolved by a difference in ad- 
dresses of two specified instructions into an instruction 
with a size that is based on a size of the program counter 
relative value calculated from the addresses of the two 
specified instructions; and an optimized code generat- 
ing unit for generating optimized code by converting ad- 
dresses of instructions in accordance with the sizes of 
instructions after conversion by the converting unit. 
[0034] The above construction achieves an optimiza- 
tion apparatus for generating programs for a processor 
that executes branch instructions. 



[0035] Here, the program counter relative value cal- 
culating unit may include a lower bit subtracting unit and 
an upper bit subtracting unit, the lower bit subtracting 
unit subtracting lower bits of the address of the one of 
5 the two specified instructions from lower bits of the ad- 
dress of the other of the two specified instructions, for 
setting a result of a subtraction as lower bits of the pro- 
gram counter relative value, and sending any carry gen- 
erated in the subtraction to the upper bit subtracting unit, 
io and the upper bit subtracting unit subtracting upper bits 
of the address of one of the two specified instructions 
and any carry received from the lower bit subtracting unit 
from upper bits of the address of the other of the two 
specified instructions, and for setting a result of a sub- 
's traction as upper bits of the program counter relative val- 
ue. 

[0036] The above construction achieves an optimiza- 
tion apparatus for generating programs for a processor 
which, when executing a branch instruction, calculates 
20 the address of a branch destination instruction using a 
carry method. 

[0037] Here, the program counter relative value cal- 
culating unit may include a lower bit subtracting unit and 
an upper bit subtracting unit, the lower bit subtracting 

25 unit subtracting lower bits of the address of one of the 
two specified instructions from lower bits of the address 
of the other of the two specified instructions without gen- 
erating a carry and setting a result of a subtraction as 
lower bits of the program counter relative value, and the 

30 upper bit subtracting unit subtracting upper bits of the 
address of one of the two specified instructions from up- 
per bits of the address of the other of the two specified 
instructions, and for setting a result of a subtraction as 
upper bits of the program counter relative value. 

35 [0038] The above construction achieves an optimiza- 
tion apparatus for generating programs for a processor 
which, when executing a branch instruction, calculates 
the address of a branch destination instruction without 
using a carry. 

40 [0039] Here, the program counter relative value cal- 
culating unit may subtract upper bits of an address of 
one of the two specified instructions from upper bits of 
an address of the other of the two specified instructions, 
set a result of a subtraction as upper bits of the program 

45 counter relative value, and set lower bits of the other of 
the two specified instructions as lower bits of the pro- 
gram counter relative value. 

[0040] The above construction achieves an optimiza- 
tion apparatus for generating programs for a processor 
so which, when executing a branch instruction, calculates 
the address of a branch destination instruction using an 
absolute value. 

[0041] The stated primary object can also be 
achieved by an assembler that generates relocatable 
55 code from an instruction sequence, each address of an 
instruction in the instruction sequence having upper bits 
that indicate a memory address at which a processing 
packet is stored and lower bits that indicate a position 
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of processing target instruction that is included in the 
processing packet, the assembler including: a label de- 
tecting unit for detecting a label in the instruction se- 
quence that should be resolved by a difference in ad- 
dresses between two specified instructions, and obtain- 
ing the addresses of the two specified instructions; a 
program counter relative value calculating unit for cal- 
culating a program counter relative value by subtracting 
an address of one of the two specified instructions from 
an address of another of the two specified instructions; 
and a replacing unit for replacing the label with the pro- 
gram counter relative value calculated by the program 
counter relative value calculating unit. 
[0042] The above construction achieves an assem- 
bler for generating programs for a processor that exe- 
cutes branch instructions. 

[0043] Here, the program counter relative value cal- 
culating unit may include a lower bit subtracting unit and 
an upper bit subtracting unit, the lower bit subtracting 
unit subtracting lower bits of the address of the one of 
the two specified instructions from lower bits of the ad- 
dress of the other of the two specified instructions, for 
setting a result of a subtraction as lower bits of the pro- 
gram counter relative value, and sending any carry gen- 
erated in the subtraction to the upper bit subtracting unit, 
and the upper bit subtracting unit subtracting upper bits 
of the address of one of the two specified instructions 
and any carry received from the lower bit subtracting unit 
from upper bits of the address of the other of the two 
specified instructions, and for setting a result of a sub- 
traction as upper bits of the program counter relative val- 
ue. 

[0044] The above construction achieves an assem- 
bler for generating programs for a processor which, 
when executing a branch instruction, calculates the ad- 
dress of a branch destination instruction using a carry 
method. 

[0045] Here, the program counter relative value cal- 
culating unit may include a lower bit subtracting unit and 
an upper bit subtracting unit, the lower bit subtracting 
unit subtracting lower bits of the address of one of the 
two specified instructions from lower bits of the address 
of the other of the two specified instructions without gen- 
erating a carry and setting a result of a subtraction as 
lower bits of the program counter relative value, and the 
upper bit subtracting unit subtracting upper bits of the 
address of one of the two specified instructions from up- 
per bits of the address of the other of the two specified 
instructions, and for setting a result of a subtraction as 
upper bits of the program counter relative value. 
[0046] The above construction achieves an assem- 
bler for generating programs for a processor which, 
when executing a branch instruction, calculates the ad- 
dress of a branch destination instruction without using 
a carry. 

[0047] Here, the program counter relative value cal- 
culating unit may subtract upper bits of an address of 
one of the two specified instructions from upper bits of 



an address of the other of the two specified instructions, 
set a result of a subtraction as upper bits of the program 
counter relative value, and set lower bits of the other of 
the two specified instructions as lower bits of the pro- 

5 gram counter relative value. 

[0048] The above construction achieves an optimiza- 
tion apparatus for generating programs for a processor 
which, when executing a branch instruction, calculates 
the address of a branch destination instruction using an 

to absolute value. 

[0049] The stated primary object can also be 
achieved by a linker that generates object code by com- 
bining relocatable code, each address of an instruction 
in the relocatable code having upper bits that indicate a 

15 memory address at which a processing packet is stored 
and lower bits that indicate a position of processing tar- 
get instruction that is included in the processing packet, 
the linker including: a relocation information detecting 
unit for detecting a label in the relocatable code that 

20 should be resolved by a difference in addresses be- 
tween two specified instructions, and obtaining the ad- 
dresses of the two specified instructions; a program 
counter relative value calculating unit for calculating a 
program counter relative value by subtracting an ad- 

25 dress of one of the two specified instructions from an 
address of another of the two specified instructions; and 
a replacing unit for replacing the label with the program 
counter relative value calculated by the program counter 
relative value calculating unit. 

30 [0050] The above construction achieves a linker for 
generating programs for a processor that executes 
branch instructions. 

[0051] Here, the program counter relative value cal- 
culating unit may include a lower bit subtracting unit and 

35 an upper bit subtracting unit, the lower bit subtracting 
unit subtracting lower bits of the address of the one of 
the two specified instructions from lower bits of the ad- 
dress of the other of the two specified instructions, for 
setting a result of a subtraction as lower bits of the pro- 

40 gram counter relative value, and sending any carry gen- 
erated in the subtraction to the upper bit subtracting unit, 
and the upper bit subtracting unit subtracting upper bits 
of the address of one of the two specified instructions 
and any carry received from the lower bit subtracting unit 

45 from upper bits of the address of the other of the two 
specified instructions, and for setting a result of a sub- 
traction as upper bits of the program counter relative val- 
ue. 

[0052] The above construction achieves a linker for 
so generating programs for a processor which, when exe- 
cuting a branch instruction, calculates the address of a 
branch destination instruction using a carry method. 
[0053] Here, the program counter relative value cal- 
culating unit may include a lower bit subtracting unit and 
55 an upper bit subtracting unit, the lower bit subtracting 
unit subtracting lower bits of the address of one of the 
two specified instructions from lower bits of the address 
of the other of the two specified instructions without gen- 
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erating a carry and setting a result of a subtraction as 
lower bits of the program counter relative value, and the 
upper bit subtracting unit subtracting upper bits of the 
address of one of the two specified instructions from up- 
per bits of the address of the other of the two specified 
instructions, and for setting a result of a subtraction as 
upper bits of the program counter relative value. 
[0054] The above construction achieves a linker for 
generating programs for a processor which, when exe- 
cuting a branch instruction, calculates the address of a 
branch destination instruction without using a carry. 
[0055] Here, the program counter relative value cal- 
culating unit may subtract upper bits of an address of 
one of the two specified instructions from upper bits of 
an address of the other of the two specified instructions, 
set a result of a subtraction as upper bits of the program 
counter relative value, and set lower bits of the other of 
the two specified instructions as lower bits of the pro- 
gram counter relative value. 

[0056] The above construction achieves a linker for 
generating programs for a processor which, when exe- 
cuting a branch instruction, calculates the address of a 
branch destination instruction using an absolute value. 
[0057] The stated primary object can also be 
achieved by a disassembler that receives an indication 
of an address of an instruction in object code and out- 
puts an assembler name of the instruction at the indi- 
cated address, each address of an instruction in the ob- 
ject code having upper bits that indicate a memory ad- 
dress at which a processing packet is stored and lower 
bits that indicate a position of processing target instruc- 
tion that is included in the processing packet, the disas- 
sembler including: a program counter relative value ex- 
tracting unit for extracting, when the indicated instruc- 
tion includes a program counter relative value, the pro- 
gram counter relative value from the indicated instruc- 
tion; a label addressing calculating unit for adding an 
address of the indicated instruction to the extracted pro- 
gram counter relative value and setting an addition re- 
sult as a label address; a storing unit for storing a label 
name corresponding to each label address; and a 
searching unit for searching the storing unit for a label 
name that corresponds to the calculated label address 
and outputting the corresponding label name. 
[0058] The stated construction can disassemble a 
program that includes a branch instruction. When the 
disassembled instruction is a branch instruction, the ad- 
dress of the branch destination instruction can be cal- 
culated from the program counter relative value. This 
address is then used to search the label table and so 
obtain the label name. As a result, the branch destina- 
tion can be displayed to the user in the readily under- 
standable form of a label name, even when program 
counter relative values are used in branch instructions. 
[0059] Here, the label address calculating unit may in- 
clude a lower bit calculating unit and an upper bit calcu- 
lating unit, the lower bit calculating unit for adding lower 
bits of the address of the indicated instruction and lower 



bits of the program counter relative value, setting a re- 
sult of an addition as lower bits of a label address, and 
sending any carry generated by the addition to the upper 
bit calculating unit, and the upper bit calculating unit 

5 adding upper bits of the address of the indicated instruc- 
tion, upper bits of the program counter relative value, 
and any carry received from the lower bit calculating 
unit, and setting a result of the an addition as upper bits 
of the label address. 

w [0060] The above construction achieves a disassem- 
bler that can disassemble programs for a processor 
which, when executing a branch instruction, calculates 
an address of a branch destination instruction using a 
carry. 

15 [0061] Here, the label address calculating unit may in- 
clude a lower bit calculating unit and an upper bit calcu- 
lating unit, the lower bit calculating unit adding lower bits 
of the address of the indicated instruction and lower bits 
of the program counter relative value without generating 

20 a carry, and setting a result of an addition as lower bits 
of a label address, and the upper bit calculating unit add- 
ing upper bits of the address of the indicated instruction 
and upper bits of the program counter relative value, and 
setting a result of an addition as upper bits of the label 

25 address. 

[0062] The above construction achieves a disassem- 
bler that can disassemble programs for a processor 
which, when executing a branch instruction, calculates 
an address of a branch destination instruction without 

30 using a carry. 

[0063] Here, the label address calculating unit may 
add upper bits of the address of the indicated instruction 
and upper bits of the program counter relative value, set 
a result of an addition as upper bits of the label address, 

35 and set lower bits of the program counter relative value 
as lower bits of the label address. 
[0064] The above construction achieves a disassem- 
bler that can disassemble programs for a processor 
which, when executing a branch instruction, calculates 

40 an address of a branch destination instruction using an 
absolute value. 

[0065] The stated primary object can also be 
achieved by a debugger that receives an indication of 
an address of an instruction in object code and replaces 

45 the instruction at the indicated address with a replace- 
ment instruction, each address of an instruction in the 
object code having upper bits that indicate a memory 
address at which a processing packet is stored and low- 
er bits that indicate a position of processing target in- 

so struction that is included in the processing packet, the 
debugger including: a processing packet reading unitfor 
reading a processing packet that is indicated by upper 
bits of the indicated address from the memory and writ- 
ing the processing packet into an instruction buffer; an 

55 instruction writing unit for writing the replacement in- 
struction into the processing packet in the instruction 
buffer over an instruction that is indicated by the lower 
bits of the indicated address; and a processing packet 
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writing unit for writing the processing packet in the in- 
struction buffer back into the memory after the replace- 
ment instruction has been written. 
[0066] The above construction reads instructions in 
units of processing packets from a memory that stores 
instructions in one-byte storage packets, rewrites in- 
structions in an instruction buffer, and writes instructions 
back into the memory in units of processing packets. 
This achieves a debugger that can debuq instructions 
whose length is not an integer number of bytes. 
[0067] The stated primary object can also be 
achieved by a compiler that generates an instruction se- 
quence from source code, the compiler generating a 
program counter relative value calculating instruction 
that is executed by a processor, the program counter 
relative value calculating instruction being an instruction 
that performs a calculation using a first value and a pro- 
gram counter relative value and uses a result of the cal- 
culation to update the first value, the first value being 
one of (a) a value of a program counter stored in a reg- 
ister, and (b) the value stored in a program counter of 
the processor, wherein upper bits of the first value indi- 
cate a memory address at which a processing packet is 
stored, and lower bits of the first value of the program 
counter indicate a processing target instruction that is 
included in the processing packet. 
[0068] The above construction achieves a compiler 
that generates programs for a processor that executes 
program counter relative value calculating instructions. 
[0069] Here, the processor may include a lower bit 
calculating unit and an upper bit calculating unit, the pro- 
gram counter relative value calculating instruction hav- 
ing the lower bit calculating unit perform a lower bit cal- 
culation and the upper bit calculating unit perform an up- 
per bit calculation, the lower bit calculation being an ad- 
dition using lower bits of the first value and lower bits of 
the value of the program counter relative value, where 
a result of the lower bit calculation is set as the lower 
bits of the first value and any generated carry is sent to 
the upper bit calculating unit, and the upper bit calcula- 
tion being an addition using upper bits of the first value, 
upper bits of the value of the program counter relative 
value and any carry received from the lower bit calcu- 
lating unit, where a result of the upper bit calculation is 
set as the upper bits of the first value. 
[0070] The above construction achieves a compiler 
that generates a program for a processor which, when 
executing a program counter relative value calculating 
instruction, performs a calculation using a value of the 
program counter and the program counter relative value 
according to a carry method. 

[0071] Here, the processor may include a lower bit 
calculating unit and an upper bit calculating unit, the pro- 
gram counter relative value calculating instruction hav- 
ing the lower bit calculating unit perform a lower bit cal- 
culation and the upper bit calculating unit perform an up- 
per bit calculation, the lower bit calculation being an ad- 
dition using lower bits of the first value and lower bits of 
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the value of the program counter relative value that does 
not generate a carry, where a result of the lower bit cal- 
culation is set as the lower bits of the first value, and the 
upper bit calculation being a calculation using upper bits 

5 of the first value and upper bits of the value of the pro- 
gram counter relative value, where a result of the upper 
bit calculation is set as the upper bits of the first value. 
[0072] The above construction achieves a compiler 
that generates a program for a processor which, when 

10 executing a program counter relative value calculating 
instruction, performs a calculation using a value of the 
program counter and the program counter relative value 
without generating a carry. 

[0073] Here, the processor may includes an upper bit 
15 calculating unit, the program counter relative value cal- 
culating instruction having the upper bit calculating unit 
perform an upper bit calculation and setting lower bits 
of the program counter relative value as lower bits of the 
first value, and the upper bit calculation being an addi- 
20 tion using upper bits of the first value and upper bits of 
the value of the program counter relative value, where 
a result of the upper bit calculation is set as the upper 
bits of the first value. 

[0074] The above construction achieves a compiler 
25 that generates a program for a processor which, when 
executing a program counter relative value calculating 
instruction, performs a calculation using a value of the 
program counter and the program counter relative value 
according to an absolute value calculating method. 

30 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0075] These and other objects, advantages and fea- 
tures of the invention will become apparent from the tol- 
as lowing description thereof taken in conjunction with the 
accompanying drawings which illustrate a specific em- 
bodiment of the invention. In the drawings: 

Fig. 1 is a block diagram showing the construction 
40 of a conventional processor; 

Fig. 2A shows the format of one instruction execut- 
ed by the processor of the first embodiment of the 
present invention; 

Fig. 2B shows the format of another instruction ex- 
45 ecuted by the processor of the first embodiment of 
the present invention; 

Fig. 2C shows the format of another instruction ex- 
ecuted by the processor of the first embodiment of 
the present invention; 
so Fig. 2D shows the format of another instruction ex- 
ecuted by the processor of the first embodiment of 
the present invention; 

Fig. 2E shows the format of another instruction ex- 
ecuted by the processor of the first embodiment of 
55 the present invention; 

Fig. 3A shows an instruction packet that is the unit 
used for storing and reading instructions in this first 
embodiment; 
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Fig. 3B shows the read order of instructions; 
Fig. 3C shows the execution order of instructions; 
Fig. 4 shows an example of the methods used by a 
conventional processor to store and read instruc- 
tions that are not byte-aligned; 5 
Fig. 5 shows the procedure by which the object 
code to be executed by the processor is generated 
by a compiler, optimization apparatus, assembler, 
and linker; 

Fig. 6 is a block diagram showing the details of the to 

processor 309 and the external memory; 

Fig. 7 is an increment table showing the rules used 

to increment the in-packet address; 

Fig. 8A is an addition table showing the addition 

rules used when adding the lower 3 bits of the ad- is 

dress of a branch instruction to lower 3 bits of the 

PC relative value; 

Fig. 8B is a subtraction table showing the subtrac- 
tion rules used when subtracting the lower 3 bits of 
the PC relative value from the lower 3 bits of a 20 
branch destination address; 
Fig. 9 is a block diagram showing the components 
and input/output data of the optimization apparatus 
303; 

Fig. 1 0 is a flowchart showing the operation proce- 25 
dure of the optimization apparatus; 
Fig. 11 shows part of the optimization processing 
code 903 generated by the code optimization appa- 
ratus 902; 

Fig. 12 shows the address assigned codes 91 6 gen- 30 
erated from the optimization processing code 903 
shown in Fig. 11; 

Fig. 13 shows the label information 906 generated 
from the address assigned codes 91 6 shown in Fig. 
12; 35 
Fig. 14 shows the optimized code 304 generated 
from the address assigned codes 91 6 shown in Fig. 
12; 

Fig. 1 5 is a block diagram that shows the construc- 
tion of the assembler 305 shown in Fig. 5 and the 40 
input/output data related to the assembler 305; 
Fig. 16 is a flowchart showing the operation of the 
assembler; 

Fig. 1 7 shows the machine language codes 803 that 
are generated from the optimized code 304 shown 45 
in Fig. 14; 

Fig. 1 8 shows the label information that is generat- 
ed from the machine language codes shown in Fig. 
17; 

Fig. 19 shows the relocatable codes that are gen- so 
erated from the machine language codes 803 
shown in Fig. 17; 

Fig. 20 is a block diagram showing the construction 
of the linker 307 and the I/O (input/output) data of 
the linker 307; ss 
Fig. 21 is a flowchart showing the operation of the 
linker 307; 

Fig. 22 shows the relocatable codes; 



Fig. 23 shows the state when the relocatable codes 
814 shown in Fig. 19 have been combined with the 
relocatable code shown in Fig. 22; 
Fig. 24 shows the resulting combined codes 703; 
Fig. 25 shows the label information that is generat- 
ed from the combined codes 703 shown in Fig. 24; 
Fig. 26 shows the object codes generated from the 
combined codes 703 shown in Fig. 24; 
Fig. 27 shows the object code generated by the sec- 
ond embodiment of the present invention; 
Fig. 28A shows the construction of an instruction 
packet in the third embodiment; 
Fig. 28B shows the types of instructions used in the 
third embodiment; 

Fig. 28C shows the relation between in-packet ad- 
dresses and the instruction units in a packet; 
Fig. 29A is an addition table showing the addition 
rules for adding the lower 3 bits of the address of 
the branch instruction and the lower 3 bits of the PC 
relative value in the calculation method of the fourth 
embodiment that does not use a carry; 
Fig. 29B is a subtraction table showing the subtrac- 
tion rules for subtracting the lower 3 bits of the ad- 
dress of the branch instruction from the lower 3 bits 
of the address of the branch destination instruction 
in the calculation method of the fourth embodiment 
that does not use a carry; 
Fig. 30 shows the object code that is generated by 
the address calculation method of the fourth em- 
bodiment that does not use a carry; 
Fig. 31 A is an addition table showing the addition 
rules for adding the lower 3 bits of the address of 
the branch instruction and the lower 3 bits of the PC 
relative value in the calculation method of the fifth 
embodiment that uses absolute values; 
Fig. 31 B is a subtraction table showing the subtrac- 
tion rules for subtracting the lower 3 bits of the ad- 
dress of the branch instruction from the lower 3 bits 
of the address of the branch destination instruction 
in the calculation method of the fifth embodiment 
that uses absolute values; 

Fig. 32 shows the object code that is generated by 
the above address calculation method of the fifth 
embodiment that uses absolute values; 
Fig. 33 shows the object code that has been gen- 
erated using the linear calculation method of the 
sixth embodiment; 

Fig. 34 shows the processor of the seventh embod- 
iment; 

Fig. 35A shows the operation that corresponds to a 
PC adding instruction which is shown in mnemonic 
form; 

Fig. 35B shows the operation that corresponds to a 
PC subtracting instruction which is shown in mne- 
monic form; 

Fig. 36 shows the construction of the compiler of 
the eighth embodiment of the present invention; 
Fig. 37 is a flowchart showing the operation of the 
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compiler; 

Fig. 38 shows source code which is written in C lan- 
guage; 

Fig. 39 shows the intermediate codes that have 
been generated from the source program shown in 
Fig. 38; 

Fig. 40 shows the assembler code that has been 
produced by converting the intermediate codes 
shown in Fig. 39; 

Fig. 41 is a block diagram showing the construction 
of the debugger and disassembler of the present 
embodiment; 

Fig. 42 is a flowchart showing the operating proce- 
dure of a disassembler of the present invention; and 
Fig. 43 is a flowchart showing the operation of the 
debugger of the present invention. 

DESCRIPTION OF THE PREFERRED 
EMBODIMENTS 

[0076] The following is a detailed description of sev- 
eral embodiments of the present invention, with refer- 
ence to the accompanying drawings. 

First Embodiment 

[0077] This first embodiment relates to an optimiza- 
tion apparatus, an assembler, and a linkerthat generate 
programs where read operations and execute opera- 
tions have different units, and to a processor for execut- 
ing such programs. 

Formats of the Instructions Executed by the Processor 

[0078] The following explains the formats of the in- 
structions executed by the processor of this first embod- 
iment. These formats are shown in Figs. 2A ~ 2E. The 
instructions executed by the present processor are con- 
structed so that 21 bits is set as one instruction unit. For 
the present processor, there are both one-unit (i.e., 
21 -bit) and two-unit (i.e., 42-bit) instructions. 
[0079] The format information 101 is written as one 
bit and shows the length of each instruction. When the 
format information 101 is "0", this shows that the unit 
including this format information 101 forms one com- 
plete instruction, which is to say, a 21 -bit instruction. 
When the format information 101 is "1 ", this shows that 
the unit includingthis format information 101 and the fol- 
lowing unit together form one two-unit instruction, which 
is to say, a 42-bit instruction. 

[0080] The parallel execution boundary information 
1 00 is also written as one bit and shows whether a par- 
allel execution boundary exists between the instruction 
formed by the present unit and the following instruction. 
When the parallel execution boundary information 100 
is "1 ", this shows that a parallel execution boundary ex- 
ists between the instruction including this parallel exe- 
cution boundary information 100 and the following in- 



struction, so that these instructions will be executed in 
different cycles. When the parallel execution boundary 
information 100 is "0", this shows that no parallel exe- 
cution boundary exists between the instruction including 
5 this parallel execution boundary information 1 00 and the 
following instruction, so that these instructions will be 
executed the same cycle. 

[0081] The remaining bits in each instruction are used 
to show an operation. This means that 1 9 bits can be 

10 used to indicate the operation in a 21 -bit instruction and 
that 40 bits can be used to indicate the operation in a 
42-bit instruction. The fields marked M Op1", M Op2", 
"Op3", and "Op4" are used to store opcodes that indi- 
cate the type of operation to be performed. The field 

15 marked "Rs" is used to store the register number of a 
register used as the source operand and the field 
marked "Rd" is used to store the register number of a 
register used as the destination operand. The fields 
marked "imm5" and "imm32" are respectively used to 

20 store 5-bit and 32-bit immediates that are used in cal- 
culations. Finally, the fields marked "disp13" and 
"disp32" are respectively used to store 1 3-bit and 32-bit 
displacements. 

[0082] Transfer instructions and arithmetic instruc- 
ts tions that handle long (such as 32-bit) constants and 
branch instructions that use large displacements are de- 
fined as 42-bit instructions. Most other instructions are 
defined as 21 -bit instructions. Of the two units used to 
compose a 42-bit instruction, the latter unit is only used 
30 to store part of the long constant or displacement, and 
so does not store the opcode of the instruction. 

Reading and Execution of Instructions by the Processor 

35 [0083] The following explains the operation of the 
present processor when reading and executing instruc- 
tions. Note that the processor of the present embodi- 
ment has a premise that static parallel scheduling is 
used. Fig. 3A shows an instruction packet that is the unit 
40 used for storing and reading instructions. Each instruc- 
tion packet is composed of three instruction units (63 
bits) and dummy data (1 bit). In each cycle, the proces- 
sor reads instructions using this fixed 64-bit packet 
length. Packets of this size are used because the 21 -bit 
45 unit size of instruction is not suited to reading from mem- 
ory. Accordingly, a number of such instructions are read 
together with dummy data to make the total packet size 
equal to an integer number of bytes. In this example, 
since the number of instruction units in each instruction 
so packet is not a power of two, there is the following spe- 
cial effect. This effect overcomes the problems that oc- 
cur when positions of the units inside instruction packets 
are expressed using binary. In the following explanation, 
the three units in an instruction packet are called the 
55 first, second and third units in order starting from the unit 
with the lowest address value. 

[0084] Fig. 3B shows the read order of instructions. 
As shown in the figure, one instruction packet is read in 
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each cycle. 

[0085] Fig. 3C shows the execution order of instruc- 
tions. In each cycle, instructions are executed as tar as 
the next parallel execution boundary. This means that 
the instructions are executed up to and including an in- 
struction whose parallel execution boundary information 
100 is "1". Instruction units that are read but not execut- 
ed are accumulated in the instruction buffer, and are ex- 
ecuted in a later cycle. 

[0086] As described above, the processor of the 
present embodiments reads instructions using packets 
of a fixed length, but only executes a suitable number 
of units in each cycle depending on parallelism of the 
instructions. The reason that the present processor can 
start the execution of instructions in one cycle at any of 
the instruction units in an instruction packet is that an 
in-packet address specifies an instruction unit in an in- 
struction packet. This is described in more detail later. 
[0087] Fig. 4 shows an example of the methods used 
by a conventional processor to store and read instruc- 
tions that are not byte-aligned. When 21 -bit instructions 
that are not byte-aligned are to be read in byte-units, 
three unused bits have to be added to the end of each 
instruction to make the instruction length 24-bits. This 
means that what are essentially 21 -bit instructions are 
stored into and read from memory in 24-bit units. The 
length of three of such instructions is 72 bits, so that the 
storage of three instructions in a 64-bit packet in the 
present embodiment reduces overall program size. 
[0088] Note that while the present embodiment de- 
scribes the packet construction when 21 -bit instructions 
are used, the invention is not limited to this instruction 
length. It is equally possible to construct instruction 
packets of instructions of a different length and to read 
the instructions using such instruction packets. As one 
example, when instructions are n-bits long, values of m 
and r may be selected so as to give a maximum value 
of n*m+(n*m+r) subject to (n*m+r)mod8=0. One packet 
is then composed of m instruction units (each being n 
bits long) and r-bit dummy data. By doing so, instruction 
packets can be composed of multiple-byte size using 
relatively little dummy data. 

Method for Expressing Instruction Addresses 

[0089] The following explains the method used to ex- 
press instruction addresses in the present embodiment. 
Here, an instruction address refers to the address used 
to specify the position of a unit and is expressed as 32 
bits. 

[0090] The upper 29-bits of a 32-bit address are used 
to specify an instruction packet and so are called the 
"packet address". This packet address is expressed as 
a 29-bit hexadecimal figure in a format such as 
"29'h01 234567". A value produced by shifting the value 
of this packet address by 3-bits to the left is the memory 
address at which the instruction packet is stored. 
[0091] The lower 3-bits in a 32-bit address are used 
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to specify an instruction unit included in the instruction 
packet and so are called the "in-packet address". This 
in-packet address is expressed as a 3-bit binary value 
inaformatsuch as "3'b001 ". As examples, the in-packet 

5 address "3'b001 " specifies the first unit in an instruction 
packet, the in-packet address "3'b010" specifies the 
second unit, and the in-packet address "3'b100" speci- 
fies the third unit. However, the in-packet addresses are 
not limited to these specific values. Other values may 

10 be used provided that the instruction units in an instruc- 
tion packet are each specified using their own value. 
[0092] The indicating of addresses in this embodi- 
ment is such that only 3 bits are assigned for eight-bytes 
of instructions. This gives the same results as when a 

15 conventional processor assigns a separate address to 
each byte, since the upper 29-bits of addresses as- 
signed to eight-bytes of instructions will be the same. 

Method for Generating the Object Code Executed by the 
20 Processor 

[0093] The following explains the method for gener- 
ating the object code that is executed by the processor 
of the present embodiment. 
25 [0094] First, the terminology to be used in this expla- 
nation is defined. 

[0095] A "PC relative value" is the difference between 

the addresses of two instructions. 

[0096] A "label" is either an "instruction address-re- 

30 solved label" or a "PC relative value-resolved label". Ab- 
solute address-resolved labels are replaced with abso- 
lute addresses of instructions during the processing that 
converts a program into object code. An example of 
such a label is the label "L2" in the transfer instruction 

35 "mov L2,r1 " that transfers an instruction stored in mem- 
ory to the register r1 . PC relative value-resolved labels 
are replaced with PC relative values during the process- 
ing that converts a program into object code. An exam- 
ple of such a label is the label "L1 " in the unconditional 

40 branch instruction "bra L1 " that performs an uncondi- 
tional branch using the PC relative value. "Local labels" 
and "external labels" also exist as other types of label. 
When a label and the instruction including the label are 
included in the same module (a module being a subpro- 

45 gram composed of an instruction sequence achieving 
one processinq function), such label is called a local la- 
bel, while when the label and instruction including the 
label are included in different modules, such label is 
called an external label. 

so [0097] Fig. 5 shows the procedure by which the object 
code to be executed by the processor is generated by 
acompiler, optimization apparatus, assembler, and link- 
er. An overview of the functions of these components is 
given below. 

55 [0098] The compiler 301 analyzes the content of the 
source code 300 that is written in a high-level language 
like C and outputs assembler code 302. 
[0099] The optimization apparatus 303 assigns tem- 
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porary addresses to the assembler code 302, links the 
instruction sequences in groups of three instruction 
units, and outputs optimized code 304 as the linked re- 
sults. In this process, local labels are calculated as PC 
relative values or instruction addresses. The instruction 
size, which is to say, whether an instruction should be 
expressed as a one-unit instruction or as a two-unit in- 
struction, is then determined based on the value of the 
PC relative value or the instruction address. 
[0100] The assembler 305 outputs relocatable codes 
306 which it generates from the optimized code 304. 
This processing converts local labels that should be re- 
solved with PC relative values into PC relative values. 
[0101] The linker 307 combines a plurality of modules. 
That is, the linker 307 combines a plurality of relocatable 
codes 306 and outputs the resulting object code 308. In 
this processing, unresolved labels are converted into 
PC relative values or instruction addresses. 
[0102] The processor 309 executes the object code 
308. 

[0103] As described above, a program written in a 
high-level language is converted by the compiler 301, 
the optimization apparatus 303, the assembler 305, and 
the linker 307 into object code that is in a format exe- 
cutable by the processor. Each label in the program is 
converted into a PC relative value or an instruction ad- 
dress by one of the steps in the above procedure. Ad- 
dress resolution for local labels that should be resolved 
by a PC relative value is performed by the assembler 
305. Address resolution for local labels that should be 
resolved by an instruction address and address resolu- 
tion for external labels are performed by the linker 307. 
[0104] The following describes the construction and 
operation of the processor 309, the linker 307, the as- 
sembler 305, and the optimization apparatus 303 shown 
in Fig. 4. 

Processor 

[0105] Fig. 6 is a block diagram showing the details 
of the processor 309 and the external memory. 
[0106] The processor 309 is capable of executing a 
maximum of three instructions in parallel. This proces- 
sor 309 includes calculators 401a ~ 401c, general reg- 
isters 402, an upper PC 403, a lower PC 404, an upper 
PC calculator 411, a lower PC calculator 405, an INC 
412, an instruction buffer 408, an prefetch upper counter 
410, a prefetch lower counter 413, instruction decoder 
409a ~ 409c, a PC relative value selector 420, an im- 
mediate selector 421 , an operand data buffer 423, and 
an operand address buffer 422. The external memory 
includes the data memory 406 and the instruction mem- 
ory 407. 

[0107] In the following explanation, the upper PC 403 
and the lower PC 404 will be collectively referred to as 
the "PC", and the upper PC calculator 411 and the lower 
PC calculator 405 will be collectively referred to as the 
"PC calculator". 



[01 08] The f i rst calcu lator 40 1 a, the second calcu lator 
401b, and the third calculator 401c each perform one 
calculation. These calculators are capable of calculating 
at the same time. 
5 [0109] The general registers 402 store data, address- 
es and other data. 

[0110] The upper PC 403 stores the upper 29 bits of 
the address of the first instruction in a set of instructions 
to be executed in the next cycle, which is to say, a packet 
to address. 

[0111] The lower PC 404 stores the lower 3 bits of the 
address of the first instruction in a set of instructions to 
be executed in the next cycle, which is to say, an in- 
packet address. 

15 [0112] The instruction memory 407 stores instructions 
that are expressed by the object code 308. 
[0113] The instruction buffer 408 stores instructions 
that have been read from the instruction memory 407. 
[011 4] The first instruction decoder 409a, the second 

20 instruction decoder 409b, and third instruction decoder 
409c decode instructions and, if the respective instruc- 
tions are executable, give indications to other compo- 
nents in the processor to have the instructions executed. 
The first instruction decoder 409a receives an input of 

25 the first instruction stored in the instruction buffer 408, 
the second instruction decoder 409b an input of the next 
instruction, and the third instruction decoder 409c an in- 
put of a next instruction. These instruction decoders 
409a ~ 409c investigate whether there is a parallel ex- 

30 ecution boundary between the instruction units and only 
have the instructions that should be executed in the 
present cycle executed. As one example, when an in- 
struction performs a calculation using a constant, the 
constant is sent to the first calculator 401a via the im- 

35 mediate selector 421 and the first calculator 401 a is in- 
structed to perform the calculation. For a branch instruc- 
tion, a PC relative value is sent via the PC relative value 
selector 420 to the lower PC calculator 405 and upper 
PC calculator 411 that are then instructed to update the 

40 pc The instruction decoders 409a ~ 409c send control 
signals showing the number of executed instruction 
units to have the INC 412 update the PC increment, and 
send control signals showing the number of executed 
instruction units to the instruction buffer 408 to have the 

45 executed instruction units deleted from the instruction 
buffer 408. 

[011 5] The PC relative value selector 420 outputs the 
PC relative value outputted by the instruction decoders 
409a ~ 409c to the lower PC calculator 405 and the up- 
so per PC calculator 411. 

[0116] The immediate selector 421 outputs an imme- 
diate outputted by the instruction decoders 409a ~ 409c 
to the general registers 402 and the calculators 401a ~ 
401c. 

55 [0117] The INC 412 receives information regarding 
the number of executed instruction units via control sig- 
nals sent by the instruction decoders 409a ~ 409c, and 
increments the value of the upper PC 403 and the lower 
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PC 404 in accordance with this number. By doing so, 
the INC 41 2 sets the packet address of the first instruc- 
tion in the set of instructions to be executed in the next 
cycle in the upper PC 403 and the in-packet address of 
the first instruction in the set of instructions to be exe- 
cuted in the next cycle in the lower PC 404. 
[0118] The upper PC calculator 411 and lower PC cal- 
culator 405 respectively update the upper PC 403 and 
the lower PC 404. When a branch instruction is decoded 
by the instruction decoders 409a ~ 409c, the upper PC 
calculator 411 and lower PC calculator 405 respectively 
receive the upper 29 bits and the lower 3 bits of the PC 
relative value included in the branch instruction of the 
PC relative value. The lower PC calculator 405 increas- 
es or decreases the present value of the lower PC 404 
by the lower 3 bits in the PC relative value and sends 
the calculation result to the lower PC 404 as the new 
lower PC. The upper PC calculator 411 increases or de- 
creases the present value of the upper PC 403 by the 
upper 29 bits in the PC relative value and sends the cal- 
culation result to the upper PC 403 as the new upper 
PC. This operation of the PC calculators is described 
later in this specification. As described above, when a 
branch instruction is executed, the packet address of the 
branch destination instruction that is to be executed next 
is set in the upper PC 403 and the in-packet address is 
set in the lower PC 404. There are also cases where the 
upper PC calculator 411 and lower PC calculator 405 
update the PC by calculating an address using a PC rel- 
ative value and an address stored in the general regis- 
ters 402. 

[0119] The prefetch upper counter 410 shows the up- 
per 29 bits of the address of the first instruction in the 
set of instructions to be read from the instruction mem- 
ory 407, which is to say, the packet address. The 
prefetch upper counter 410 normally increments this 
value by one in each cycle. When a branch instruction 
was executed in the previous cycle, the packet address 
of the branch destination instruction set in the upper PC 
403 is sent to the prefetch upper counter 410 where it 
is set in place of the present value in the prefetch upper 
counter 410. 

[0120] The prefetch lower counter 41 3 shows the low- 
er 3 bits of the address of the first instruction in the set 
of instructions read from the instruction memory 407, 
which is to say, the in-packet address. In this embodi- 
ment, the value "3'b000" is set in the prefetch lower 
counter 41 3. As a result, the instructions to be read are 
indicated in packet units, so that one packet is sent from 
the instruction memory 407 to the instruction buffer 408 
in each cycle. 

[0121] The data memory 406 stores operand data. 
[0122] The operand data buffer 423 and operand ad- 
dress buffer 422 are buffers that are located between 
the data memory 406 and the processor. 
[0123] The following explains the incrementing meth- 
od and calculating method for instruction addresses. 
Th is is the most characteristic feature of the present em- 



bodiment. 

Incrementing Method for Instruction Addresses 

5 [0124] The incrementing of addresses is performed 
by adding an increment value to the in-packet address 
of an instruction, and adding any carry produced by the 
addition to the packet address. 

[0125] Fig. 7 is an increment table showing the rules 

10 used to increment the in-packet address. As shown in 
the figure, when the in-packet address is "3'b000" or 
"3'b010", the incrementing of the instruction address is 
performed by adding 2 to the in-packet address. When 
the in-packet address is "3'b100", a carry to the packet 

15 address is produced (which is to say, 1 is to be added 
to the upper 29 bits of the instruction address) and the 
in-packet address is updated to "3'b000". This means 
that the incrementing of the in-packet address is a cal- 
culation that cycles through the three values "3'b000", 

20 "3'bOIO", and "3'b100". As one example, when the in- 
crement value is "2" and the value of the in-packet ad- 
dress before incrementing is "3'b100", the packet ad- 
dress after incrementing is "3'b010" and a carry of "1 " 
to the packet address is generated. 

25 [0126] Note that in the present embodiment, the in- 
packet address does not need to be expressed in binary. 
This is especially effective when the number of instruc- 
tion units in an instruction packet is not a power of 2. 
When this is the case, it is not possible to express the 

30 position of an instruction unit in an instruction packet in 
binary and use a binary calculation to shift the position 
of an instruction unit. However, in the present embodi- 
ment, the position of an instruction unit in an instruction 
packet is expressed using m different values. By using 

35 a calculation that cycles through these m values, the 
specifying of instruction units and the calculations for 
shifting the instruction position can be achieved even if 
the number of instruction units in an instruction packet 
is not a power of 2. 

40 

Method for Calculating the Instruction Address 

[0127] The following explains the carry method which 
is one of the methods used for calculating the instruction 

45 addresses in the present invention. Other methods used 
to calculate addresses are a separation method, an ab- 
solute position indicating method, and a linear address- 
ing method, though these will be described later in this 
specification. In the carry method, the upper 29 bits and 

so lower 3 bits of an instruction address are calculated sep- 
arately. However, when calculating the upper bits, any 
carry to or from the upper 29 bits that occurred when 
calculating the lower 3 bits is taken into account. 
[0128] The following explains the method by which 

55 the present processor adds the address of a branch in- 
struction and a PC relative value to find a branch desti- 
nation address. The lower PC calculator 405 shown in 
Fig. 6 adds the lower 3 bits of the address of a branch 
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instruction to the lower 3 bits of the PC relative value. 
Fig. 8A is an addition table showing the addition rules 
used when adding the lower 3 bits of the address of a 
branch instruction to lower 3 bits of the PC relative value. 
As shown in Fig. 8A, this addition of the lower 3-bit val- 
ues differs from a binary calculation in being a calcula- 
tion that cycles through the three values "3'bOOO", 
"3'b010", and "3'b100". When a carry occurs as shown 
in Fig. 8A, the lower PC calculator 405 sends the carry 
to the upper PC value to the upper PC calculator 411. 
[0129] The upper PC calculator 411 shown in Fig. 6 
adds the upper 29 bits of the address of a branch in- 
struction to the upper 29 bits of the PC relative value. 
When doing so, if the calculation of the lower PC calcu- 
lator 405 has resulted in a carry to the upper PC, the 
upper PC calculator 411 also adds this carry. This addi- 
tion is a normal addition of binary values. 
[0130] The addition results of the lower PC calculator 
405 and upper PC calculator 411 form the address of 
the branch destination instruction. The addition result 
for the lower 3 bits is set in the lower PC 404 and the 
addition result for the upper 29 bits is set in the upper 
PC 403. 

[0131] The following explains the calculations of the 
optimization apparatus 303, assembler 305, and linker 
307 for finding the PC relative value, which is to say the 
subtraction of the branch instruction address from the 
branch destination address. Like the addition described 
above, this subtraction is performed separately for the 
upper 29 bits and lower 3 bits. The lower address sub- 
traction means 907 of the optimization apparatus 303, 
the lower address subtraction means 806 of the assem- 
bler 305, and the lower address subtraction means 706 
of the linker 307 subtract the lower 3 bits of the branch 
instruction address from the lower 3 bits of the branch 
destination address. Fig. 8B is a subtraction table show- 
ing the subtraction rules used when subtracting the low- 
er 3 bits of the PC relative value from the lower 3 bits of 
a branch destination address. As shown in Fig. 8B, this 
subtraction of the lower 3-bit values differs from a binary 
calculation in being a calculation that cycles through the 
three values "3'bOOO", "3'b010", and "3'b100". When a 
carry occurs as shown in Fig. 8B, the lower address sub- 
traction means that performs the calculation (such as 
lower address subtraction means 907) sends the carry 
from the upper PC value to the corresponding upper ad- 
dress subtraction means (such as upper address sub- 
traction means 910). The various upper address sub- 
traction means are described in more detail later. 
[0132] The upper address subtraction means 910 in 
the optimization apparatus 303, the upper address sub- 
traction means 809 in the assembler 305, and upper ad- 
dress subtraction means 709 in the linker 307 subtract 
the upper 29 bits of the address of a branch instruction 
from the upper 29 bits of the address of the branch des- 
tination instruction. When doing so, if the calculation of 
the lower address subtraction means 907 (or similar) 
has resulted in a carry from the upper PC, the upper 



address subtraction means 910 (or similar) also sub- 
tracts this carry. This subtraction is a normal subtraction 
of binary values. 

[0133] These subtraction results respectively form the 

5 lower 3 bits and the higher 29 bits of the PC relative 
value. This method is also used when the processor 
finds the address of a branch destination instruction by 
executing a subtraction on the address of a branch in- 
struction and a PC relative value. 

to [0134] The optimization apparatus 303, assembler 
305, and linker 307, which calculate a PC relative value 
from the difference between the address of a branch 
destination instruction and the address of a branch in- 
struction, and the processor 309, which calculates the 

15 addressof a branch destination instruction using this PC 
relative value, calculate addresses using the same carry 
method. As a result, when executing a branch instruc- 
tion, the processor can correctly calculate the address 
of a branch destination instruction from the PC relative 

20 value. This address calculation method that uses a carry 
has a feature in that it can calculate addresses perform 
separate calculations for upper bits and lower bits while 
maintaining the continuity between the two. 

25 Optimization Apparatus 

[0135] Fig. 9 is a block diagram showing the compo- 
nents and input/output data of the optimization appara- 
tus 303 shown in Fig. 5. This optimization apparatus 303 

30 optimizes the assembler code 302 generated by the 
compiler 301, links the instruction sequences together 
in packets of three instruction units, and outputs the re- 
sulting optimized code 304. The optimization apparatus 
303 includes a code optimization apparatus 902, an ad- 

35 dress assigning means 904, a label detecting means 
905, a lower address subtraction means 907, an upper 
address subtraction means 910, an address difference 
calculating means 912, and a label information resolving 
means 914 

40 [0136] The code optimization apparatus 902 optimiz- 
es the assembler code 302 and so generates the opti- 
mization processing code 903. This processing of the 
code optimization apparatus 902 is the same as any 
well-known optimization apparatus, and so will not be 

45 described. 

[0137] The address assigning means 904 estimates 
an address for each instruction in the optimization 
processing code 903 produced by the code optimization 
apparatus 902 and assigned an estimated address to 

so each instruction. These addresses are called provision- 
al addresses in this specification. As a result, the ad- 
dress assigning means 904 outputs the address as- 
signed codes 916 

[0138] The label detecting means 905 detects local 
55 labels from the address assigned codes 91 6. On detect- 
ing a label that should be resolved by an instruction ad- 
dress, the label detecting means 905 obtains the provi- 
sional address of the instruction including this label. 
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Conversely, on detecting a label that should be resolved 
by a PC relative value, the label detecting means 905 
obtains the provisional addresses of the instruction in- 
cluding this label and the branch destination instruction. 
After this, the label detecting means 905 outputs the la- 
bel information 906 that shows the instructions that in- 
clude labels and information on values for resolving 
these labels. 

[0139] The lower address subtraction means 907, the 
upper address subtraction means 910, and the address 
difference calculating means 912 calculate the PC rel- 
ative values for labels, in the label information 906, that 
should be resolved by PC relative values. 
[0140] The lower address subtraction means 907 
subtracts the lower 3 bits of the provisional address of 
a branch instruction from the lower 3 bits of the provi- 
sional address of the branch destination instruction and 
outputs the resulting carry value 908 and lower subtrac- 
tion result 909. 

[0141] The upper address subtraction means 910 
subtracts the upper 29 bits of the provisional address of 
a branch instruction and the carry value 908 calculated 
by the lower address subtraction means 907 from the 
upper 29 bits of the provisional address of the branch 
destination instruction and outputs the resulting upper 
subtraction result 911. 

[0142] The address difference calculating means 91 2 
finds the address difference 913 by setting the lower 
subtraction result 909 calculated by the lower address 
subtraction means 907 as the lower 3 bits and the upper 
subtraction result 911 calculated by the upper address 
subtraction means 91 0 as the upper 29 bits. 
[0143] The label information resolving means 914 
converts an instruction in the optimization processing 
code 903 including the present label into an instruction 
of a suitable size, based on an address that was esti- 
mated and assigned by the address assigning means 
904 or the address difference 91 3 found by the address 
difference calculating means 912. If the assigned ad- 
dress or the address difference 913 can be expressed 
using no more than 1 3 bits, the label information resolv- 
ing means 91 4 converts the instruction into a 21 -bit in- 
struction, or if not the label information resolving means 
914 converts the instruction into a 42-bit instruction. 
[0144] After the labels have been resolved, the label 
information resolving means 914 links the instruction se- 
quences into packets of three instruction units and out- 
puts the result as the optimized code 304. 
[0145] The following describes a specific operation of 
the optimization apparatus 303. 
[0146] Fig. 10 is a flowchart showing the operation 
procedure of the optimization apparatus. 
[0147] First, the code optimization apparatus 902 op- 
timizes the assembler code 302 and generates optimi- 
zation processing code 903. Part of the optimization 
processing code 903 generated by the code optimiza- 
tion apparatus 902 is shown in Fig. 11. Of the instruc- 
tions in Fig. 11, "L1 :mov r2,r1 " 1000 shows the position 



of the label L1 and is an instruction that indicates a trans- 
fer from register r2 to register r1 . The instruction "jsr f " 
is a function call that performs a relative branch to the 
label f (an external label). A return from the function call 

5 to this address is performed by a "ret" instruction. The 
instruction "add r0,r4" adds the values of registers rO 
and r4 and stores the result in register r4. The instruction 
"and r1,r3" 1003 calculates a logical AND for the values 
in register r1 and r3 and stores the result in register r3. 

10 The instruction "mov L2,r2" 1004 transfers the address 
of the instruction located at the label L2 into the register 
r2. The instruction "1d (r2),r0" 1005 transfers the data 
stored at the address stored in register r2 into the reg- 
ister rO. The instruction "bra L1" 1006 performs an indi- 

15 rect branch to the label L1 (a local label) Note that in 
Fig. 11, the instructions that continue after instruction 
1007 have been omitted, though these instructions do 
not include an instruction located at the label f (step 
S9001). 

20 [0148] The address assigning means 904 assigns a 
provisional address to each instruction in the optimiza- 
tion processing code 903 and so generates address as- 
signed codes 916. Fig. 12 shows the address assigned 
codes 916 generated from the optimization processing 

25 code 903 shown in Fig. 11 . In this example, provisional 
addresses starting from the value "32'b00000800" have 
been assigned (step S9002). 

[0149] The label detecting means 905 detects local 
labels in the address assigned codes 916 and outputs 

30 label information 906 composed of instructions that in- 
clude the detected labels and information on the values 
used to resolve those labels. Fig. 13 shows the label 
information 906 that is generated from the address as- 
signed codes 916 shown in Fig. 12. As shown in this 

35 figure, label L2 of instruction 1 1 04 is detected as a label 
that should be resolved by an instruction address and 
label L1 is detected as a label that should be resolved 
by a PC relative value. Information showing the address 
for resolving the label L2 is appended to the instruction 

40 "mov L2,r2" that includes the label 12, and information 
showing the addresses of the branch destination in- 
struction and branch instruction to be used for calculat- 
ing a PC relative value is appended to the instruction 
"bra L1" that includes the label L1. Note that since the 

45 label f in instruction 1101 is an external label, it is not 
optimized (steps S9003, S9004). 
[0150] When the label information 906 includes a la- 
bel that should be resolved by a PC relative value, 
processing to calculate this PC relative value is per- 

so formed. The lower address subtraction means 907 cal- 
culates the lower 3 bits of the value shown by the label 
L1 that is a PC relative value. The lower address sub- 
traction means 907 subtracts the lower 3 bits "3'b010" 
of the provisional address "32'h00000812" of the branch 

55 instruction 1 1 06 from the lower 3 bits "3'bOOO" of the pro- 
visional address "32'h00000800" of the branch destina- 
tion instruction 1100. As a result, "1 " is obtained as the 
carry value 908, and "3'b100" is obtained as the lower 
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subtraction result 909 (steps S9005, S9006). 
[0151] The upper address subtraction means 910 cal- 
culates the upper 29 bits of the value shown by the label 
L1 that is a PC relative value. The upper address sub- 
traction means 910 subtracts the upper 29 bits 
"29'h000001 02" of the provisional address of the branch 
instruction 1 1 06 and the carry value 908 "1 " generated 
by the lower address subtraction means 907 from the 
upper 29 bits "29'h000001 00" of the provisional address 
of the branch destination instruction 1100. As a result, 
"29'h1ffffffd" ("-3" in base 10, minus numbers being 
hereafter shown using a complement) is obtained as the 
upper subtraction result 911 (step S9007). 
[0152] The address difference calculating means 91 2 
finds the address difference, which is to say the PC rel- 
ative value, by setting the lower subtraction result 909 
as the lower bits and the upper subtraction result 911 as 
the upper bits. In this example, the address difference 
calculating means 912 sets "3'b100" as the lower bits 
and "29'h1ffffffd" as the upper bits, giving an address 
difference of "32'hffffffec" (step S9008). 
[0153] The label information resolving means 914 
judges whether the value used to resolve the label in the 
label information 906 can be expressed by a 13-bit val- 
ue. The value that resolves the label L2 shown in Fig. 
13 is "32'h1 2345678", so that this value cannot be ex- 
pressed as a 13-bit value, meaning that instruction 1104 
including this label L2 will become a 42-bit instruction. 
On the other hand, the value used to resolve label L1 is 
"32'hffffffec", which can be expressed by a 1 3-bit value. 
Accordingly, the instruction 1106 that includes label L1 
will become a 21 -bit instruction (steps S9009, S9010, 

59011) . 

[0154] The label information resolving means 914 
links the instruction sequences into packets of three in- 
struction units, based on the address assigned codes 
916. When doing so, the label information resolving 
means 914 converts instructions that include labels into 
instructions of the determined size. Here, one instruc- 
tion unit is used for 21 -bit instructions, and two units are 
used for 42-bit instructions. After this, the label informa- 
tion resolving means 914 outputs the instruction se- 
quences that it has converted into packets as the opti- 
mized code 304. Fig. 14 shows the optimized code 304 
generated from the address assigned codes 916 shown 
in Fig. 12. In Fig. 14, each row shows the instructions 
that form one instruction packet, with the marks "II" 
showing the boundaries between instructions in a pack- 
et. Curved brackets "()" are used in this drawing to indi- 
cate 42-bit instructions that each occupy two units (step 

59012) . 

[0155] As described above, addresses are estimated 
with a calculation method that uses a carry. In this way, 
a suitable optimization apparatus for a processor that 
uses a carry method can be achieved. 
[0156] Note that the provisional addresses assigned 
by the address assigning means 904 and the PC relative 
values calculated by the address difference calculating 



means 912 are values that are estimated for determin- 
ing the sizes of all instructions that include labels. There 
are cases when these estimates differ from the actual 
values, so that these values are not used hereafter in 
5 the processing. 

Assembler 

[0157] Fig. 15 is a block diagram that shows the con- 

10 struction of the assembler 305 shown in Fig. 5 and the 
input/output data related to the assembler 305. This as- 
sembler 305 converts the optimized code 304 generat- 
ed by the optimization apparatus 303 into relocatable 
codes 306 that have a relocatable address format. The 

15 assembler 305 includes a machine language code gen- 
erating means 802, a label detecting means 804, a lower 
address subtraction means 806, an upper address sub- 
traction means 809, an address difference calculating 
means 811, and a label information resolving means 

20 813. The machine language code generating means 
802 converts the optimized code 304 into machine lan- 
guage codes 803 that can be executed by the processor 
309. However, labels whose values have not been re- 
solved are not converted and are stored in the machine 

25 language codes 803 as they are. The machine language 
code generating means 802 assigns a packet address 
and an in-packet address to each machine language 
code. As described later, the labels are later resolved 
using these addresses. 

30 [0158] The label detecting means 804 finds a label 
that should be resolved by a PC relative value, which is 
to say, a difference in addresses between two instruc- 
tions and obtains the addresses of the branch instruc- 
tion and the branch destination instruction. Afterthis, the 

35 labe I detecting means 804 outputs label information 805 
that is composed of the instructions that include labels 
and the values that resolve these labels. 
[0159] To resolve the label information 805 obtained 
by the label detecting means 804, the lower address 

40 subtraction means 806, the upper address subtraction 
means 809, and the address difference calculating 
means 811 calculate a PC relative value as follows. 
[0160] The lower address subtraction means 806 
subtracts the lower 3 bits of the address of a branch in- 

45 struction from the lower 3 bits of the address of the 
branch destination instruction and outputs the carry val- 
ue 807 and the lower subtraction result 808. 
[0161] The upper address subtraction means 809 
subtracts the upper 29 bits of the address of a branch 

so instruction and the carry value 807 calculated by the 
lower address subtraction means 806 from the upper 29 
bits of the address of the branch destination instruction 
and outputs the resulting upper subtraction result 810. 
[0162] The address difference calculating means 811 

55 finds the address difference 812 by setting the lower 
subtraction result 808 calculated by the lower address 
subtraction means 806 as the lower 3 bits and the upper 
subtraction result 810 calculated by the upper address 
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subtraction means 809 as the upper 29 bits. 
[0163] The label information resolving means 813 re- 
places the labels in the machine language codes 803 
with the address differences 812 calculated by the ad- 
dress difference calculating means 811, and outputs the 
resulting relocatable codes 306. 
[0164] The following explains a specific example of 
the processing of the assembler 305 on receiving an in- 
put of the optimized code 304 of Fig. 14 that has been 
outputted by the optimization apparatus 303. 
[0165] Fig. 16 is a flowchart showing the operation of 
the assembler. 

[0166] First, the machine language code generating 
means 802 converts each packet in the optimized code 
304 into machine language codes 803 that are suited to 
the processor 309. However, the machine language 
code generating means 802 does not convert labels 
whose values have not been resolved, so that these la- 
bels are stored as they are in the machine language 
codes 803. After this, the machine language code gen- 
erating means 802 assigns packet addresses (hereafter 
also called "local packet addresses") and in-packet ad- 
dresses to each instruction in the machine language 
codes 803. Fig. 17 shows the machine language codes 
803 that are generated from the optimized code 304 
shown in Fig. 1 4. Note that the actual machine language 
codes are expressed in binary as sequences of zeros 
and ones, though for ease of understanding these ma- 
chine language codes are shown in Fig. 17 in mnemonic 
form. The parallel execution boundary information 100 
and the format information 101 will also be clear at this 
stage, but are not illustrated to simplify the figure. In Fig. 
17, packet addresses (local packet addresses) are as- 
signed starting from the value "29'hOOOOOOOO". The la- 
bel f in the instruction "jsr f" in packet 1 300, the label L2 
in the instruction "mov L2,r2" in packet 1301, and the 
label L1 in the instruction "bra L1" in packet 1302 have 
not yet been resolved, so that these instructions are not 
converted (steps S1500, S1501). 
[01 67] Next, the label detecting means 804 detects la- 
bels, out of the unresolved labels in the machine lan- 
guage codes 803, which are local labels that should be 
resolved by a PC relative value, and obtains the address 
of the instruction including the label, which is to say, the 
branch instruction, and the address of the branch des- 
tination instruction. The label detecting means 804 then 
outputs label information 805 that includes information 
showing the instruction including the label and the value 
that resolves the label. Fig. 18 shows the label informa- 
tion 805 that is generated from the machine language 
codes shown in Fig. 17. Here, label L1 is detected as a 
local label that should be resolved by a PC relative val- 
ue, "32'h00000012" is obtained as the address of the 
branch instruction, and "32'hOOOOOOOO" is obtained as 
the address of the branch destination instruction (steps 
S1502, S1503). 

[0168] The lower address subtraction means 806 
then calculates the lower bits of the value L1 that is a 



PC relative value. The lower address subtraction means 
806 subtracts the lower 3 bits "3'b01 0" of the address 
"32'h0000001 2" of the branch instruction 1 409 from the 
lower 3 bits "3'bOOO" of the address "32'hOOOOOOOO" of 

5 the branch destination instruction 1401 . As a result, "1 " 
is obtained as the carry value 807 and "3'b100" is ob- 
tained as the lower subtraction result 808 (step S1504). 
[0169] Next, the upper address subtraction means 
809 calculates the upper bits of the value L1 that is a 

to pc relative value. The upper address subtraction 
means 809 subtracts the upper 29 bits "29'h00000002" 
of the address of the branch instruction 1409 and the 
carry value 807 "1" from the upper 29 bits 
"29'hOOOOOOOO" of the address of the branch destination 

15 instruction 1401 As a result, "29'h1ffffffd" ("-3" in base 
10, minus numbers being hereafter shown using a com- 
plement) is obtained as the upper subtraction result 810 
(step S1505). 

[0170] The address difference calculating means 811 

20 finds the address difference, which is to say the PC rel- 
ative value, by setting the lower subtraction result 808 
as the lower bits and the upper subtraction result 810 
as the upper bits. I n this example, the address difference 
calculating means 811 sets "3'b100" as the lower bits 

25 and "29'h1ffffffd" as the upper bits, giving an address 
difference of "32 , hffffffec" (step S1506). 
[0171] The label information resolving means 813 
judges whether the address difference 812 can be ex- 
pressed by only its lower 13 bits. If so, the label infor- 

30 mation resolving moans 81 3 sets the lower 13 bits of the 
address difference 812 as the PC relative value, or if 
not, the label information resolving means 813 sets the 
entire address difference 812 as the PC relative value. 
As a result, a label in the machine language codes 803 

35 is converted into a PC relative value. The address dif- 
ference that resolves label L1 in the label information in 
Fig. 17 is "32'hffffffec", which can be expressed by the 
lower 13-bit value "131 fee", so that the label L1 in the 
machine language codes shown in Fig. 17 is converted 

40 into the lower 13-bit value. Fig. 19 shows the relocatable 
codes that are generated from the machine language 
codes 803 shown in Fig. 17. In Fig. 19, the instruction 
1609 has been produced by converting the label L1 into 
a PC relative value. Fig. 19 shows the parallel execution 

45 boundary information 1 00 and format information 1 01 of 
each instruction that had already been established 
when the machine language codes 803 were outputted, 
and also shows the unused bit in each instruction packet 
(steps S1 507, S1508, S1509). 

so [0172] As described above, by finding a PC relative 
value by performing address calculation according to a 
carry method, an assembler corresponding to a proces- 
sor that uses a carry method can be realized. 

55 Linker 

[0173] Fig. 20 is a block diagram showing the con- 
struction of the linker 307 shown in Fig. 5 and the I/O 
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(input/output) data of the linker 307. This linker 307 com- 
bines a plurality of relocatable codes 701, determines 
the addresses of each instruction, and outputs the ob- 
ject code 714 that is executable by the processor 309 
and is in absolute address format. The linker 307 in- 
cludes the code combining means 702, the relocation 
information detecting means 704, the lower address 
subtraction means 706, the upper address subtraction 
means 709, the address difference calculating means 
711, and the relocation information resolving means 
713 

[0174] The code combining means 702 combines a 
plurality of inputted relocatable codes 701 and deter- 
mines the addresses of all instructions. The code com- 
bining means 702 then resolves the labels that should 
be resolved by instruction addresses using the deter- 
mined addresses and outputs the combined codes 703 
that result from its operation. 

[0175] The relocation information detecting means 
704 searches for external labels that should be resolved 
by PC relative addresses and obtains the addresses of 
branch instructions and the branch destination instruc- 
tions. After doing so, the relocation information detect- 
ing means 704 outputs relocation information 705 in- 
cludes information showing instructions that include la- 
bels and values to be used to resolve the labels. To re- 
solve the resulting relocation information 705, the lower 
address subtraction means 706, the upper address sub- 
traction means 709, and the address difference calcu- 
lating means 711 calculate PC relative values, as de- 
scribed below. 

[0176] The lower address subtraction means 706 
subtracts the lower 3 bits of the address of the branch 
instruction from the lower 3 bits of the address of the 
branch destination instruction, and so generates a carry 
value 707 and a lower subtraction result 708. 
[0177] The upper address subtraction means 709 
subtracts the upper 29 bits of the address of the branch 
instruction and the carry value 707 generated by the 
lower address subtraction means 706 from the upper 29 
bits of the address of the branch destination instruction, 
and so generates the upper subtraction result 710. 
[0178] The address difference calculating means 711 
sets the lower subtraction result 708 calculated by the 
lower address subtraction means 706 as the lower 3 bits 
and the upper subtraction result 710 calculated by the 
upper address subtraction means 709 as the upper 29 
bits to generate the address difference 712. 
[0179] The relocation information resolving means 
713 replaces labels in the combined codes 703 with ad- 
dress differences 712 calculated by the address differ- 
ence calculating means 711, and outputs the resulting 
object code 308. 

[0180] The operation of the linker 307 is explained be- 
low using an example where the relocatable codes 306 
shown in Fig. 19 that have been outputted by the as- 
sembler 305 have been inputted. 
[0181] Fig. 21 is a flowchart showing the operation of 



the linker 307. 

[0182] First, the code combining means 702 com- 
bines a plurality of relocatable codes 701 . Fig. 23 shows 
the state when the relocatable codes 814 shown in Fig. 

5 19 have been combined with the relocatable code 
shown in Fig. 22. The code combining means 702 com- 
bines these relocatable codes with the packet address 
of the first relocatable code in Fig. 22 as "29'hOOOOOOOO" 
and the packet address of the first relocatable code in 

io Fig. 19 as "29'h00000001 " (step S2000, S2001). 

[0183] The addresses of all instructions are deter- 
mined in this way, so that the code combining means 
702 can resolve the addresses of labels that should be 
resolved by instruction addresses and then output the 

15 resulting combined codes 703. Fig. 23 shows that the 
address of label L2 in instruction 1810 "mov L2,r2" is the 
starting address of instruction packet 1815. This ad- 
dress has been set at "32'h 12345680", so that the code 
combining means 702 uses this value to replace the la- 

20 belL2. Fig. 24 shows the resulting combined codes 703. 
In instruction 1910 in Fig. 24, the label L2 has been re- 
placed with this address "32'h1 2345680" (step S2002). 
[0184] Next, the relocation information detecting 
means 704 finds external labels in the combined codes 

25 703 that should be resolved by PC relative values and 
extracts the addresses of the instructions that include 
these labels and the addresses of the instructions where 
these labels are located, which is to say, the addresses 
of branch instructions and branch destination instruc- 

30 tions. After this, the relocation information detecting 
means 704 outputs relocation information 705 that is 
composed of information showing the instructions in- 
cluding labels and the values to be used to resolve these 
labels. Fig. 25 shows the label information that is gen- 

35 erated from the combined codes 703 shown in Fig. 24. 
Here, label f is found as an external label that should be 
resolved by a PC relative value, so that "32'hOOOOOOOa" 
is obtained as the address of a branch instruction and 
"32'hOOOOOOOO" as the address of the branch destina- 

40 tion instruction (steps S2003, S2004). 

[0185] The lower address subtraction means 706 
then calculates the lower bits of the value f that is a PC 
relative value. The lower address subtraction means 
706 subtracts the lower 3 bits "3'b010" of the address 

45 "32'hOOOOOOOa" of the branch instruction 1 906 from the 
lower 3 bits "3'bOOO" of the address "32'hOOOOOOOO" of 
the branch destination instruction 1 901 . As a result, "1 " 
is obtained as the carry value 707 and "3'b100" is ob- 
tained as the lower subtraction result 708 (step S2005). 

so [0186] Next, the upper address subtraction means 
709 calculates the upper bits of the value f that is a PC 
relative value. The upper address subtraction means 
709 subtracts the upper 29 bits "29'h00000002" of the 
address "32'hOOOOOOOa" of the branch instruction 1906 

55 and the carry value 707 "1" from the upper 29 bits 
"29'hOOOOOOOO" of the address of the branch destination 
instruction 1901. As a result, "29'h 1 ffffff e" is obtained as 
the upper subtraction result 710 (step S2006). 
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[0187] The address difference calculating means 711 
finds the address difference 712, which is to say the PC 
relative value, by setting the lower subtraction result 708 
as the lower bits and the upper subtraction result 710 
as the upper bits. In this example, the address difference 
calculating means 811 sets "3'b100" as the lower bits 
and "29'h1ffffffe" as the upper bits, giving an address 
difference of "32'hfffffff4" (step S2007). 
[0188] Next, the relocation information resolving 
means 71 3 converts a label in the combined codes 703 
into a PC relative value, setting the lower 1 3 bits of the 
address difference 712 as the PC relative value if this 
address difference 712 can be expressed by the lower 
13 bits, or otherwise setting the entire address differ- 
ence 712 as the PC relative value. The address differ- 
ence that resolves the label f in the relocation informa- 
tion in Fig. 24 is "32'hfffffff4", which can be expressed 
by the lower 13-bit value "13'h1ff4", so that the label f in 
the combined codes 703 shown in Fig. 23 is converted 
into this lower 13-bit value to produce the object code. 
The resulting object code is shown in Fig. 26. In instruc- 
tion 2106 in Fig. 26, the label f has been converted into 
the lower 13-bit value "13'h1ff4" (steps S2008, S2009, 
S2010). 

[0189] As described above, the present linker finds 
PC relative values using an address calculation includ- 
ing a carry, and so is suited to a processor that uses a 
carry. 

Specific Operation of the Processor 

[0190] The following describes the operation of the 
processor when the object code shown in Fig. 26 has 
been stored in the instruction memory 407. 
[0191] At the start of execution of this object code, the 
upper PC 403 is set at "29'hOOOOOOOO" and the lower 
PC 404 is set at "3'b000". The prefetch upper counter 
410 receives an input from the upper PC 403 and so is 
set at "29'hOOOOOOOO". 

[0192] The read of instructions from the instruction 
memory 407 is performed in packet units according to 
the value in the prefetch upper counter 410. In detail, 
instruction packet 21 00 that is indicated by the prefetch 
upper counter 41 0 is read from the instruction sequence 
stored in the instruction memory 407 and is stored in the 
instruction buffer 408. The value of the prefetch upper 
counter 410 is incremented by one in each cycle, and 
so here becomes "29'h00000001 ". Hereafter, an in- 
struction packet indicated by the prefetch upper counter 
41 0 is read from the instruction memory 407 and written 
into the instruction buffer 408 in each cycle. 
[0193] The following explains the operations for de- 
coding and executing instructions for the case when in- 
struction packet 2104 is indicated by the upper PC 403 
and instruction 2107 in instruction packet 2104 is indi- 
cated by the lower PC 404. The instructions stored in 
the instruction buffer 408 are interpreted by the instruc- 
tion decoders 409a ~ 409c. The first instruction decoder 



409a receives an input of the first unit, unit 2107, in the 
instruction packet 2104 and investigates whether unit 
2107 is a one-unit instruction and whether there is a par- 
allel execution boundary. Since unit 2107 is a one-unit 

5 instruction and there is no parallel execution boundary, 
the second instruction decoder 409b receives an input 
of the next unit, unit 21 09, and investigates whether unit 
21 09 is a one-unit instruction and whether there is a par- 
allel execution boundary. Since unit 2109 is a one-unit 

10 instruction and there is no parallel execution boundary, 
the third instruction decoder 409c receives an input of 
the next unit and investigates whether this next unit is a 
one-unit instruction and whether there is a parallel exe- 
cution boundary. Since this unit is not a one-unit instruc- 

15 tion, the third instruction decoder 409c also receives an 
input of the following unit. The third instruction decoder 
409c then finds that this following unit includes a parallel 
execution boundary As a result, the instructions 2107, 
2109, and 2110 are executed in parallel. 

20 [0194] The first instruction decoder 409a decodes the 
instruction "add r0,r4" and outputs control signals to the 
first calculator 401a. The first calculator 401a adds the 
values of registers rO and r4 and stores the result in reg- 
ister r4. The second instruction decoder 409b decodes 

25 the instruction "and r1 ,r3" and outputs control signals to 
the second calculator 401 b. The second calculator 401 b 
performs a logical operation on the values of registers 
r1 and r3, and stores the result in register r3. The third 
instruction decoder 409c decodes the instruction "mov 

30 32'h12345680,r2" and so has the immediate 
"32'h 12345680" transferred into register r2. 
[0195] In this case, the instruction decoders 409a ~ 
409c inform the INC 412 that a total of four instruction 
units have been executed. The INC 412 increments the 

35 values in upper PC 403 and the lower PC 404 by four 
units. As a result, the lower PC 404 becomes "3'bOOO", 
a carry of two to the upper PC 403 is generated, and the 
upper PC 403 becomes "29'h00000003". This means 
that the first instruction to be executed in the next cycle 

40 is instruction 2112. 

[0196] The first instruction decoder 409a receives an 
input of the first unit, unit 21 12, and investigates whether 
unit 2112 is a one-unit instruction and whether there is 
a parallel execution boundary. Since unit 2112 is aone- 

45 unit instruction and there is no parallel execution bound- 
ary, the second instruction decoder 409b receives an 
input of the next unit, unit 211 3, and investigates wheth- 
er unit 2113 is a one-unit instruction and whether there 
is a parallel execution boundary. Here, the second in- 

so struction decoder 409b finds that unit 21 09 is a one-unit 
instruction and that there is a parallel execution bound- 
ary. As a result, the processor 309 finds that instructions 
2112 and 2113 can be executed in parallel. 
[0197] The first instruction decoder 409a decodes the 

55 instruction "1d (r2),r0", has the operand data, which has 
the value in register r2 as the operand address, read 
from the data memory 406 and stored in register rO. The 
second instruction decoder 409b decodes the instruc- 
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tion "bra 1 3'hlfec", and, since this is a branch instruction, 
updates the values in the upper PC 403 and lower PC 

404 using the address of the branch destination instruc- 
tion. 

[0198] First, the address indicated by the upper PC 
403 and lower PC 404 is amended. While a PC relative 
value shows the difference in addresses between a 
branch instruction and its branch destination instruction, 
the upper PC 403 and lower PC 404 show the address 
of the first address to be executed in the same cycle as 
the branch instruction, so that the upper PC 403 and 
lower PC 404 are amended so that they indicate the ad- 
dress of the branch instruction. In detail, the INC 412 
increments the values of the upper PC 403 and lower 
PC 404 by one unit to show that the branch instruction 
21 1 3 is preceded by one instruction unit, the first instruc- 
tion 2112 As a result, the lower PC 404 becomes 
"3'b01 0" and the upper PC 403 stays at "29'h00000003". 
[0199] Following this, the upper PC calculator 411 and 
the lower PC calculator 405 add the PC relative value 
"1 3'hlfec" obtained by the second instruction decoder 
409b to the upper PC 403 and the lower PC 404. Here, 
the sign-extended 32-bit value "32'hffffffec" is used as 
the PC relative value. This addition is split into additions 
of the upper 29 bits and the lower 3 bits. 
[0200] The lower PC calculator 405 adds the lower 3 
bits "3'b100" of the PC relative value to the value 
"3'b010" of the lower PC 404. As a result, a carry of one 
and the lower calculation result "3'b000" are obtained 
The lower PC calculator 405 sends the carry to the up- 
per PC calculator 41 1 , and sends the lower calculation 
result to the lower PC 404. 

[0201] Next, the upper PC calculator 411 adds the up- 
per 29 bits "29'hlffffffd" of the PC relative value and the 
carry value "1" received from the lower PC calculator 

405 to the value "29'h00000003" of the upper PC 403. 
The upper PC calculator 411 sends the upper calcula- 
tion result of "29'h00000001 " to the upper PC 403, which 
sends the value on to the prefetch upper counter 410. 
As a result of this processing, the prefetch upper counter 
410 is set at "29'h00000001 ", so that the next instruction 
packet to be prefetched will be instruction packet 2104. 
Also, since the upper PC 403 is "29'h00000001 " and the 
lower PC 404 is "3'bOOO", the first instruction to be exe- 
cuted in the next cycle is instruction 2105. 

[0202] Hereafter, codes in the object code are succes- 
sively read and executed in the same way, so that no 
explanation will be given for the other instructions. 
[0203] This completes the detailed explanation of the 
constructions of the processor 309, linker 307, assem- 
bler 305 and optimization apparatus 303 shown in Fig. 
5. A conventional compiler can be used as the compiler 
301 , so that no explanation of such will be given. 
[0204] Note that while the processor of this embodi- 
ment includes three instruction decoders 409a ~ 409c 
and three calculators 401a ~ 401c, the present inven- 
tion is not limited to this construction, so that only one 
instruction decoder and one calculator may by provided. 



It is also possible for the functions of the optimization 
apparatus 303 to be incorporated into the compiler 301 , 
and to have the object code 308 generated from the 
source code 300 by the compiler 301 , the assembler 

5 305, and the linker 307. 

[0205] In the present embodiment, the prefetch lower 
counter 41 3 was described as having the fixed value of 
"3'b000", thouqh this need not be the case. As one ex- 
ample, this value may be incremented by one in each 

10 cycle. This results in one byte of data being read from 
the instruction memory 407 and written into the instruc- 
tion buffer 408 in each cycle. 

Second Embodiment 

15 

[0206] The second embodiment of the present inven- 
tion relates to a modification of the processor, optimiza- 
tion apparatus, assembler, and linker of the first embod- 
iment. This modification uses a different value as the PC 

20 relative value for resolving labels in branch instructions. 
[0207] In the first embodiment, the PC relative value 
in a branch instruction is a difference in addresses be- 
tween the branch instruction and the branch destination 
instruction, while in this second embodiment, the PC rel- 

25 ative value in a branch instruction is a difference be- 
tween the address of the branch destination instruction 
and the address of the first instruction in same set of 
instructions as the branch instruction. 
[0208] In this way, the PC relative value has a slightly 

30 different meaning than in the present embodiment. 
However, if the devices used to generate a program (i. 
e., the optimization apparatus 303, assembler 305, and 
linker 307 that calculate the PC relative value) use the 
same meaning as the device that executes the program 

35 (i.e., a processor that calculates an address based on 
the PC relative value), the processor will be able to cor- 
rectly change the program counter to the address of a 
branch destination instruction when executing a branch 
instruction. 

40 [0209] The following explains the optimization appa- 
ratus 303, assembler 305, linker 307, and processor. 
[0210] The label detecting means 905 of the optimi- 
zation apparatus 303 generates the label information 
906 for labels that should be resolved by PC relative val- 

45 ues in the following way. Instead of generating label in- 
formation after obtaining the provisional addresses of 
the branch instruction and the branch destination in- 
struction in the same way as in the first embodiment, the 
label detecting means 905 generates the label informa- 

so tion 906 after obtaining the provisional addresses of the 
branch destination instruction and the address of the 
first instruction in the same set of instructions as the 
branch instruction. In the same way as in the first em- 
bodiment, this label information 906 is then used to cal- 

55 culate the address difference 913 that is the difference 
between two provisional addresses and is used in the 
optimized code 304. The assembler and linker also op- 
erate in this way. 
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[0211] The following describes a specific example of 
the object code 308 generated in this embodiment. 
[0212] The assembler 305 replaces the label L1 in in- 
struction 1409 in the machine language codes shown in 
Fig. 17 with the subtraction value "13h'1ff0" produced 
by subtracting the address "32'h00000010" of instruc- 
tion 1408, which is the first instruction in same set of 
instructions as instruction 1409, from the address 
"32'hOOOOOOOO" of the branch destination instruction. In 
the same way, the linker 307 replaces the label f in in- 
struction 1 906 in the combined codes shown in Fig. 24 
with the subtraction value "13'h1ff8" produced by sub- 
tracting the address "32'h00000008" of the instruction 
1 907, which is the first instruction in same set of instruc- 
tions as instruction 1906, from the address 
"32'hOOOOOOOO" of the branch destination instruction. 
Fig. 27 shows that the PC relative value of instruction 
221 3 differs from that shown in Fig. 26. 
[0213] The following describes the processor of the 
present embodiment. 

[0214] The processor 309 executes object code that 
have been generated as described above. When the 
processor 309 executes a branch instruction, the PC rel- 
ative value in the branch instruction is a difference in 
addresses between the branch destination instruction 
and the first instruction in same set of instructions as the 
branch instruction. Accordingly, the processor 309 does 
not amend the values of the upper PC 403 and lower 
PC 404, and, in the same way as in the first embodiment, 
adds the PC relative value to the values in the upper PC 
403 and lower PC 404 and updates the values in the 
upper PC 403 and lower PC 404 using the addition re- 
sults. When Lhis processor 309 executes the object 
code shown in Fig. 27, the execution of instruction 221 3 
results in the PC relative value "13h1ff8" being added 
to the present PC "32'h00000008", resulting in the PC 
being updated to "32'hOOOOOOOO". 
[0215] As described above, the processor of the 
present embodiment does not need to amend the value 
of the program counter in the same way as in the first 
embodiment whenever a branch instruction is executed. 
The address of a branch destination instruction can in- 
stead be obtained by directly adding a PC relative value 
to the PC. This reduces the total execution time. 

Third Embodiment 

[0216] The third embodiment of the present invention 
relates to a processor that can indicate the execution 
position of an instruction by fully utilizing the lower 3 bits 
of instruction addresses. 

[0217] In the first embodiment, the lower 3 bits of the 
instruction address are used to indicate a position that 
is one of three units. In the present embodiment, how- 
ever, full use is made of these 3 bits by having them 
indicate one of eight units. 

[0218] Fig. 28A shows the construction of an instruc- 
tion packet in the present embodiment. This instruction 



packet is composed of eight instruction units. Each in- 
struction unit in an instruction packet is 8 bits long, so 
that the total length of one instruction packet is 64 bits. 
The processor in this embodiment reads one instruction 

5 packet (64 bits) in one cycle. 

[0219] Fig. 28B shows the types of instructions used 
in this embodiment. Each instruction is composed of 
8-bit instruction units, with there being one-, two-, three-, 
four-, five-, and six-unit instructions. 

to [0220] Fig. 28C shows the relation between in-packet 
addresses and the instruction units in a packet. In the 
same way as in the first embodiment, a position in an 
instruction packet is indicated by the lower 3 bits of an 
instruction address. As shown in Fig. 28C, the in-packet 

15 address "3'bOOO" indicates the first unit, the in-packet 
address "3'b001 " indicates the second unit, the in-pack- 
et address "3'b010" indicates the third unit, the in-packet 
address "3'b011" indicates the fourth unit, the in-packet 
address "3'b100" indicates the fifth unit, the in-packet 

20 address "3'b101 " indicates the sixth unit, the in-packet 
address "3'b1 10" indicates the seventh unit, and the in- 
packet address "3'blll" indicates the eighth unit. 
[0221] As described above, the processor of the 
present embodiment indicates the execution position of 

25 an instruction making full use of the lower 3 bits of the 
instruction address. As a result, instructions can be ex- 
ecuted with a greater variation of execution units for one 
cycle. 

30 Fourth Embodiment 

[0222] The fourth embodiment of the present inven- 
tion relates to a method for calculating instruction ad- 
dresses without using a carry. 

35 [0223] The first embodiment teaches a processor for 
executing a program, and an optimization apparatus, 
assembler, and linker for generating a suitable program. 
All of these devices use a common method for calculat- 
ing an instruction address using a carry. This has the 

40 effect that the processor can correctly generate the ad- 
dress of a branch destination instruction using a PC rel- 
ative value. However, this effect can be achieved if the 
processor, optimization apparatus, assembler, and link- 
er use a common address calculation method that does 

45 not use a carry. This present embodiment relates to 
such a calculation method that calculates addresses 
without using a carry. 

[0224] This calculation method that does not use a 
carry resembles the calculation method in the first em- 

so bodiment in that the calculation of address is performed 
separately for the upper 29 bits and lower 3 bits. How- 
ever, the present method differs by not using a carry. 
[0225] The following explains the method by which 
the processor finds the address of a branch destination 

55 instruction by adding the address of a branch instruction 
and a PC relative value. The lower PC calculator 405 
shown in Fig 6 adds the lower 3 bits of the address of 
the branch instruction and the lower 3 bits of the PC rel- 
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ative value. Fig. 29A is an addition table showing the 
addition rules for adding the lower 3 bits of the address 
of the branch instruction and the lower 3 bits of the PC 
relative value in the present calculation method. As 
shown in the figure, this calculation differs from a normal 
addition of binary values in that it cycles between the 
three states "3'bOOO", "3'b010", and "3'b100". Note that 
no carry is generated. 

[0226] The upper PC calculator 411 shown in Fig. 6 
adds the upper 29 bits of the address of the branch in- 
struction and the upper 29 bits of the PC relative value. 
This is a normal addition of binary values. 
[0227] The results of the above additions form the ad- 
dress of a branch destination instruction. In detail, the 
addition result for the lower 3 bits is set in the lower PC 
404 and the addition result for the upper 29 bits is set in 
the upper PC 403. 

[0228] The following explains the method used by the 
optimization apparatus, assembler, and linker to calcu- 
late the PC relative value, which is to say, to subtract 
the address of the branch destination instruction from 
the address of the branch instruction. This subtraction 
is split into an upper 29 bits and lower 3 bits like the 
addition performed by the processor. The lower address 
subtraction means 907 of the optimization apparatus 
303, the lower address subtraction means 806 of the 
assembler 305, and the lower address subtraction 
means 706 of the linker 307 subtract the lower 3 bits of 
the address of a branch instruction from the lower 3 bits 
of the address of the branch destination instruction. Fig. 
29B is a subtraction table showing the subtraction rules 
for subtracting the lower 3 bits of the address of the 
branch instruction from the lower 3 bits of the address 
of the branch destination instruction. As shown in the 
figure, this calculation differs from a normal subtraction 
of binary values in that it cycles between the three states 
"3'bOOO", "3'b010", and "3'b1 00". Note that no carry is 
generated. 

[0229] The upper address subtraction means 910 of 
the optimization apparatus 303, the upper address sub- 
traction means 809 of the assembler 305, and the upper 
address subtraction means 709 of the linker 307 sub- 
tract the upper 29 bits of the address of the branch in- 
struction from the upper 29 bits of the address of the 
branch destination instruction. This is a normal subtrac- 
tion of binary values. 

[0230] The PC relative value is then found by setting 
the result of the above subtraction for the lower 3 bits 
as the lower 3 bits and the result of the above subtraction 
for the upper 29 bits as the upper 29 bits. 
[0231] Fig. 30 shows the object code that is generated 
by the above address calculation method of the present 
embodiment that does not use a carry. The PC relative 
values of instructions 2406 and 2413 differ to those in 
Fig. 26. The following explains the calculation of the PC 
relative value of instruction 2406. 
[0232] The lower address subtraction means 706 
subtracts the lower 3 bits "3'b010" of the address of in- 



struction 2406 from the lower 3 bits "3'bOOO" of the ad- 
dress of instruction 2401 in accordance with the sub- 
traction table shown in Fig. 29B. This produces the lower 
subtraction result "3'b100". 

5 [0233] The upper address subtraction means 709 
subtracts the upper 29 bits "29'h00000001 " of the ad- 
dress of instruction 2406 from the upper 29 bits 
"29'hOOOOOOOO" of the address of instruction 2401 . This 
produces the upper subtraction result "29'h1fffffff". 

w [0234] The address difference calculating means 711 
generates the address difference "32'h1ffffffc" by setting 
the upper subtraction result "29'h1fffffff" as the upper 29 
bits and the lower subtraction result "3'b100" as the low- 
er 3 bits. 

15 [0235] The relocation information resolving means 
713 judges that the address difference "32'h 1 ffffffc" can 
be expressed by just the lower 1 3 bits "13'h1ffc" and so 
replaces a label with this value "13'h1ffc" as a PC rela- 
tive value to generate instruction 2406. 

20 [0236] The processor 309 executes the object code 
generated as described above. When executing a 
branch instruction, the processor 309 adds the upper 
PC 403 and lower PC 404, which have been amended 
to correctly indicate the branch instruction, to the PC rel- 

25 ative value in the branch instruction without generating 
a carry. 

[0237] When the processor 309 executes instruction 
2406 in the object code shown in Fig. 30, the lower PC 
calculator 405 adds the amended lower PC 404 "3'b01 0" 

30 and the lower 3 bits "3'b100" of the PC relative value 
and updates the lower PC 404 to the resulting addition 
value "3'bOOO". The upper PC calculator 411 adds the 
amended upper PC 403 "29'h00000001 " and the upper 
29 bits "29'h1fffffff" of the PC relative value and updates 

35 the lower PC 404 to the resulting addition value 
"29'h00000000". 

[023B] As described above, the present calculation 
method can calculate addresses without a carry being 
sent between the lower PC calculator 405 and the upper 
40 pc calculator 41 1 . This means that address calculation 
can be performed with a simpler hardware construction. 

Fifth Embodiment 

45 [0239] The fifth embodiment of the present invention 
teaches a method for calculating instruction addresses 
using absolute values. 

[0240] This calculation method that uses absolute val- 
ues resembles the calculation method in the first em- 

so bodiment in that the calculation of address is performed 
separately for the upper 29 bits and lower 3 bits. How- 
ever, the present method differs from the carry method 
in that the value of the lower 3 bits of an instruction ad- 
dress are set as the lower 3 bits of the calculation result. 

55 [0241] The following explains the method by which 
the processor finds the address of a branch destination 
instruction by adding the address of a branch instruction 
and a PC relative value. The lower PC calculator 405 
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shown in Fig. 6 adds the lower 3 bits of the address of 
the branch instruction and the lower 3 bits of the PC rel- 
ative value. Fig. 31 A is an addition table showing the 
addition rules for adding the lower 3 bits of the address 
of the branch instruction and the lower 3 bits of the PC 
relative value in the present calculation method that us- 
es absolute values. As shown in the figure, the lower 3 
bits of the PC relative value are set as the lower 3 bits 
of the addition result. 

[0242] The upper PC calculator 411 shown in Fig. 6 
adds the upper 29 bits of the address of the branch in- 
struction and the upper 29 bits of the PC relative value. 
This is a normal addition of binary values. 
[0243] The results of the above additions form the ad- 
dress of a branch destination instruction. In detail, the 
addition result for the lower 3 bits is set in the lower PC 
404 and the addition result for the upper 29 bits is set in 
the upper PC 403. 

[0244] The following explains the method used by the 
optimization apparatus 303, assembler 305, and linker 
307 to calculate the PC relative value, which is to say, 
to subtract the address of the branch destination instruc- 
tion from the address of the branch instruction. This sub- 
traction is split into an upper 29 bits and lower 3 bits, 
like the addition performed by the processor. The lower 
address subtraction means 907 of the optimization ap- 
paratus 303, the lower address subtraction means 806 
of the assembler 305, and the lower address subtraction 
means 706 of the linker 307 subtract the lower 3 bits of 
the address of a branch instruction from the lower 3 bits 
of the address of the branch destination instruction. Fig. 
31 B is a subtraction table showing the subtraction rules 
for subtracting the lower 3 bits of the address of the 
branch instruction from the lower 3 bits of the address 
of the branch destination instruction in this calculation 
method that uses absolute values. As shown in the fig- 
ure, the lower 3 bits of the branch destination address 
are set as the subtraction result for the lower 3 bits. 
[0245] The upper address subtraction means 910 of 
the optimization apparatus 303, the upper address sub- 
traction means 809 of the assembler 305, and the upper 
address subtraction means 709 of the linker 307 sub- 
tract the upper 29 bits of the address of the branch in- 
struction from the upper 29 bits of the address of the 
branch destination instruction. This is a normal subtrac- 
tion of binary values. 

[0246] The PC relative value is then found by setting 
the result of the above subtraction for the lower 3 bits 
as the lower 3 bits and the result of the above subtraction 
for the upper 29 bits as the upper 29 bits. 
[0247] Fig. 32 shows the object code that is generated 
by the above address calculation method of the present 
embodiment that uses absolute values. The PC relative 
values of instructions 2606 and 2613 differ to those in 
Fig. 26. The following explains the calculation of the PC 
relative value of instruction 2606. 
[0248] The lower address subtraction means 706 
subtracts the lower 3 bits "3'b010" of the address of in- 
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struction 2406 from the lower 3 bits "3'bOOO" of the ad- 
dress of instruction 2401 in accordance with the sub- 
traction table shown in Fig. 31 B. This produces the lower 
subtraction result "3'b000". 

5 [0249] The upper address subtraction means 709 
subtracts the upper 29 bits "29'h00000001 " of the ad- 
dress of instruction 2406 from the upper 29 bits 
"29'hOOOOOOOO" of the address of instruction 2401 . This 
produces the upper subtraction result "29'h1fffffff". 

w [0250] The address difference calculating means 711 
generates the address difference "32'h 1 ffffff8" by setting 
the upper subtraction result "29'h1fffffff" as the upper 29 
bits and the lower subtraction result "3'b000" as the low- 
er 3 bits. 

15 [0251] The relocation information resolving means 
713 judges that the address difference "32'h1ffffff8" can 
be expressed by just the lower 13 bits "13'h1ff8" and so 
replaces a label with this value "13'h1ff8" as a PC rela- 
tive value to generate instruction 2606. 

20 [0252] The processor 309 executes the object code 
generated as described above. When executing a 
branch instruction, the processor 309 adds the upper 
PC 403 and lower PC 404, which have been amended 
to correctly indicate the branch instruction, to the PC rel- 

25 ative value in the branch instruction using the present 
absolute value method. 

[0253] When the processor 309 executes instruction 
2606 in the object code shown in Fig. 32, the lower PC 
calculator 405 adds the amended lower PC 404 "3'b01 0" 

30 and the lower 3 bits "3'b000" of the PC relative value 
and updates the lower PC 404 to the resulting addition 
value "3'bOOO". The upper PC calculator 411 adds the 
amended upper PC 403 "29'h00000001 " and the upper 
29 bits "29'h1fffffff" of the PC relative value and updates 

35 the lower PC 404 to the resulting addition value 
"29'h00000000". 

[0254] As described above, the present calculation 
method can calculate addresses without needing to cal- 
culate the lower bits, so that the speed for calculating 
40 addresses can be improved. 

Sixth Embodiment 

[0255] The sixth embodiment of the present invention 
45 relatestoa linear calculation method for addresses. Un- 
like the other embodiments, this linear calculation meth- 
od calculates instruction addresses without splitting the 
calculation into an upper 29 bits and lower 3 bits. 
[0256] The following explains the present method for 
so finding the address of a branch destination instruction 
from the address of a branch instruction and a PC rela- 
tive value. While the processor that uses the carry meth- 
od is equipped with an upper PC calculator 411 for cal- 
culating the upper 29 bits and a lower PC calculator 405 
55 for calculating the lower 3 bits, a processor that uses the 
present linear calculation method is only equipped with 
one PC calculator for calculating a 32-bit address. The 
PC calculator in this linear calculation method adds a 
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32-bit address of a branch instruction and a 32-bit PC 
relative value. This calculation is a normal binary addi- 
tion. 

[0257] The addition result of the PC calculator is set 
as the address of the branch destination instruction. 
This means that the lower 3 bits of the addition result 
are set in the lower PC 404 and the upper 29 bits of the 
addition result are set in the upper PC 403. 
[0258] The following explains the calculation of the 
PC relative value by the optimization apparatus 303, as- 
sembler 305, and linker 307, which is to say, the sub- 
traction of the address of the branch instruction from the 
address of the branch destination instruction. Like the 
processor in this embodiment, the optimization appara- 
tus 303, assembler 305, and linker 307 are each provid- 
ed with only one calculator, the address subtraction 
means, for calculating a 32-bit address. The address 
subtraction means in this linear calculation method sub- 
tracts the address of a branch instruction from the ad- 
dress of a branch destination instruction. This calcula- 
tion is a normal binary subtraction. The subtraction re- 
sult is then set as the PC relative value. 
[0259] Fig. 33 shows the object code that has been 
generated using the linear calculation method of the 
present embodiment. In Fig. 33, the PC relative values 
in instructions 2706 and 2713 differ to those shown in 
Fig. 26. The following describes the method for calcu- 
lating the PC relative value for instruction 2706. 
[0260] The address subtraction means in the linear 
calculation method subtracts the 32-bit address 
"32'h00000000" of instruction 2701 from the 32-bit ad- 
dress "32'hOOOOOOOa" of instruction 2706 and so ob- 
tains the address difference "32'hfffffff6". 
[0261] The relocation information resolving means 
713 judges that the address difference "32'hfffffff6" can 
be expressed by just its lower 13 bits "13'h1ff6", and so 
replaces the label with "13'h1ff6" as the PC relative val- 
ue to generate instruction 2706. 
[0262] The processor 309 executes the object code 
generated as described above. When executing a 
branch instruction, the processor 309 adds the upper 
PC 403 and lower PC 404 that have been amended to 
indicate the address of the branch instruction to the PC 
relative value using the present linear calculation meth- 
od. 

[0263] When the processor 309 executes instruction 
2706 in the object code shown in Fig. 33, the PC calcu- 
lator in this embodiment adds a 32-bit PC value 
"32'hOOOOOOOa", which has the amended value of the 
upper PC 403 as the upper 29 bits and the amended 
value of the lower PC 404 as the lower 3 bits, to the PC 
relative value "32'hfffffff6" and so obtains the addition 
result "32'hOOOOOOOO". After this, the PC calculator up- 
dates the lower PC 404 to the lower 3 bits "3'bOOO" of 
this addition value, and the upper PC 403 to the upper 
29 bits "29'hOOOOOOOO" of this addition value. 
[0264] In this way, the present linear calculation meth- 
od can calculate addresses using a standard calculator 



as the PC calculator. This simplifies the structure of the 
processor. 

Seventh Embodiment 

5 

[0265] The seventh embodiment of the present inven- 
tion relates to a processor that interprets and executes 
PC adding instructions and PC subtracting instructions 
and to a compiler that generates such instructions. 

w [0266] Fig. 34 shows the processor of the present em- 
bodiment. The processor of the present embodiment dif- 
fers from the processor in the first embodiment in that it 
further includes a second lower PC calculator 2800 and 
a second upper PC calculator 2802 and in that the first 

15 instruction decoder 2801a, the second instruction de- 
coder 2801b, and the third instruction decoder 2801c 
are all provided with new functions. 
[0267] The instruction decoders 2801a ~ 2801c are 
provided with an extra function for decoding PC adding 

20 instructions and PC subtracting instructions. Fig. 35A 
shows the operation that corresponds to a PC adding 
instruction which is shown in mnemonic form. As shown 
in Fig. 35A, a PC adding instruction adds a PC relative 
value "disp" to the value of the PC that is stored in a 

25 register and stores the addition result in the same re- 
qister. Fig. 35B shows the operation that corresponds 
to a PC subtracting instruction which is shown in mne- 
monic form. As shown in Fig. 35B, a PC adding instruc- 
tion subtracts a PC relative value "disp" from the value 

30 of the PC that is stored in a register and stores the sub- 
traction result in the same register. 
[0268] The second lower PC calculator 2800 and the 
second upper PC calculator 2802 perform the PC add- 
ing instruction and PC subtraction instruction described 

35 above, using the same calculation rules as the lower PC 
calculator 405 and the upper PC calculator 411 de- 
scribed in the first embodiment. 
[0269] Fig. 36 shows the construction of the compiler 
of the present embodiment. 

40 [0270] The source code 2901 is a program written in 
a high-level language such as C. 
[0271] The intermediate code converting unit 2902 
converts the source code 2901 into intermediate code 
2903 which is an internal expression for the compiler. 

45 This intermediate code converting unit 2902 is a well- 
known technology and so will not be described. 
[0272] The PC value adding instruction converting 
unit 2904 converts each intermediate code in the inter- 
mediate code 2903 that adds a value of the PC and a 

so variable into an assembler code 2906 for a PC adding 
instruction that is shown in Fig. 34. 
[0273] The instruction converting unit 2905 converts 
the other intermediate codes into assembler code 2906. 
This instruction converting unit 2905 is a well known 

55 technology and so will not be described. 

[0274] The following describes a specific example of 
the operation of the present compiler. Fig. 37 is a flow- 
chart showing the operation of this compiler. 
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[0275] First, the compiler receives an input of source 
code. Fig. 38 shows source code which is written in C 
language. In Fig. 38, the external functions g1, g2, g3, 
and g4 are declared, and the function f is defined as a 
function that receives the int-type variable "i". This func- 
tion f includes code that substitutes the address of func- 
tion g1 into the pointer fp if the value of "i" is 1 , substi- 
tutes the address of function g2 into the pointer fp if the 
value of "i" is 2, substitutes the address of function g3 
into the pointer fp if the value of "i" is 3, substitutes the 
address of function g4 into the pointer fp if the value of 
"i" is 4, and finally calls the function indicated by the 
pointer fp (step 3600). 

[0276] Next, the intermediate code converting unit 
2902 converts the source code into intermediate codes. 
When doing so, the intermediate code converting unit 
2902 coverts (a) a source code that substitutes a pointer 
to an external function into a pointer variable into (b) an 
intermediate code that adds the difference between the 
address of the start of present function and the address 
of the start of the external function to a temporary vari- 
able that stores the address of the start of the present 
function, and substitutes the addition result into the 
pointer variable. 

[0277] Fig. 39 shows the intermediate codes that 
have been generated from the source program shown 
in Fig. 38. The intermediate code 3201 shown in Fig. 39 
is an intermediate code that has the label f marking the 
start of the function and that substitutes the present val- 
ue of the PC, which is to say, the first address of function 
f, into the temporary variable tmp. The intermediate 
code 3202 is intermediate code that judges whether the 
value of variable i is not "1 ". The intermediate code 3203 
is an intermediate code that branches to the label L 
when the judgement by intermediate code 3203 is true, 
that is, variable i is not "1 ". The intermediate code 3204 
is executed when variable i is "1 ", and adds a difference, 
obtained by subtracting a first address of function f from 
the first address of function g1 , to the temporary variable 
tmp into which the first address of function f has been 
substituted, and has the addition result substituted into 
the variable fp. The intermediate code 3205 is an inter- 
mediate code that branches to the label L. 
[0278] The intermediate code 3206 includes the label 
LI, and is an intermediate code that judges whether var- 
iable i is not equal to "2". The intermediate code 3207 
branches to label L2 when the judgement in intermedi- 
ate code 3206 is true, which is to say, when variable i is 
not "2". The intermediate code 3208 is executed when 
variable i is equal to "2", and is an intermediate code 
that adds a difference, obtained by subtracting afirst ad- 
dress of function f from the first address of function g2, 
to the temporary variable tmp into which the first address 
of function f has been substituted, and has the addition 
result substituted into the variable fp. The intermediate 
code 3209 is an intermediate code that branches to the 
label L. 

[0279] The intermediate code 321 0 includes the label 



12, and is an intermediate code that judges whether var- 
iable i is not equal to "3". The intermediate code 3211 
branches to label L3 when the judgement in intermedi- 
ate code 321 0 is true, which is to say when variable i is 

5 not "3". The intermediate code 3212 is executed when 
variable i is equal to "3", and is an intermediate code 
that adds a difference, obtained by subtracting a first ad- 
dress of function f from the first address of function g3, 
to the temporary variable tmp into which the first address 

io of function f has been substituted, and has the addition 
result substituted into the variable fp. The intermediate 
code 321 3 is an intermediate code that branches to the 
label L. 

[0280] The intermediate code 321 4 includes the label 
15 L4, and is an intermediate code that adds a difference, 
obtained by subtracting a first address of function f from 
the first address of function g4, to the temporary variable 
tmp into which the first address of function f has been 
substituted, and has the addition result substituted into 
20 the variable fp. The intermediate code 321 5 includes the 
label L and is an intermediate code that calls the function 
indicated by the variable fp. 

[0281] As described above, the intermediate codes in 
Fig. 39 do not simply substitute the absolute address of 

25 the function g1 , g2, g3 or g4 into the variable fp, but in- 
stead add a difference between the first address of func- 
tion f and the first address of one of the functions g1, 
g2, g3, and g4 to the first address of the function f and 
substitute the addition result into the variable fp (steps 

30 S3601 ~ S3603). 

[0282] Next, the PC value adding instruction convert- 
ing unit 2904 converts the intermediate codes into as- 
sembler code. The PC value adding instruction convert- 
ing unit 2904 searches for intermediate codes that add 

35 the value of the PC to a PC relative value and converts 
such codes into assembler code that uses the second 
lower PC calculator 2800 and the second upper PC cal- 
culator 2802. The instruction converting unit 2905 then 
converts the remaining intermediate codes into assem- 

40 bier code. 

[0283] The PC value adding instruction converting 
unit 2904 ascertains that the operand tmp in intermedi- 
ate code 3204 in Fig. 39 has been set at the value of the 
PC by the intermediate code 3201 and that the operator 

45 ■+" indicates an addition of the value of the PC and a 
PC relative value, and so converts intermediate code 
3204 into the assembler code addpc that performs an 
addition using the second lower PC calculator 2800 and 
the second upper PC calculator 2802. In the same way, 

so the PC value adding instruction converting unit 2904 
converts intermediate codes 3208, 3212, and 3214 into 
assembler codes addpc. The other intermediate codes 
in Fig. 39 are converted into assembler codes by the 
instruction converting unit 2905. 

55 [0284] Fig. 40 shows the assembler code that has 
been produced by converting the intermediate codes 
shown in Fig. 39. In Fig. 40, the assembler code 3301 
has the label f marking the start of a function and is an 
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instruction that transfers the value of the PC into register 
r1 . The assembler code 2802 is an instruction that judg- 
es whether the constant "1 " and the value of register rO 
are not equal. The assembler code 3303 is an instruc- 
tion that branches to label L1 when the judgement in 
assembler code 2802 is true. The assembler code 3304 
has the second lower PC calculator 2800 and the sec- 
ond upper PC calculator 2802 add the PC relative value 
that is the difference between the first address of func- 
tion g1 and the first address of function f to the value of 
the PC which is the first address of function f and is 
stored in the register r1 , and has the result transferred 
into register r1 . The assembler code 3305 is an instruc- 
tion that branches to the label L. 
[0285] The assembler code 3306 has the label L1 and 
is an instruction that judges whether the constant "2" 
and the value of register rO are not equal. The assembler 
code 3307 is an instruction that branches to label L2 
when the judgement in assembler code 3306 is true. 
The assembler code 3308 has the second lower PC cal- 
culator 2800 and the second upper PC calculator 2802 
add the PC relative value that is the difference between 
the first address of function g2 and the first address of 
function f to the value of the PC which is the first address 
of function f and is stored in the register r1 , and has the 
result transferred into register r1. The assembler code 
3309 is an instruction that branches to the label L. 
[0286] The assembler code 331 0 has the label L2 and 
is an instruction that judges whether the constant "3" 
and the value of register rO are not equal. The assembler 
code 3311 is an instruction that branches to label L3 
when the judgement in assembler code 3310 is true. 
The assembler code 331 1 has the second lower PC cal- 
culator 2800 and the second upper PC calculator 2802 
add the PC relative value that is the difference between 
the first address of function g3 and the first address of 
function f to the value of the PC which is the first address 
of function f and is stored in the register r1 , and has the 
result transferred into register r1. The assembler code 
331 3 is an instruction that branches to the label L. 
[0287] The assembler code 331 4 has the label L3 and 
is an instruction that has the second lower PC calculator 
2800 and the second upper PC calculator 2802 add the 
PC relative value that is the difference between the first 
address of function g4 and the first address of function 
f to the value of the PC which is the first address of func- 
tion f and is stored in the register r1 , and has the result 
transferred into register r1. The assembler code 3315 
has the label Land is an instruction that calls the function 
indicated by register r1 . The assembler code 3316 is an 
instruction that ends the function. 
[0288] As described above, when there is a source 
code in function f that substitutes a pointer to the exter- 
nal function g into a pointer variable, the present com- 
piler does not generate an instruction (such as "mov 
r1 ,g") that transfers the address of the external function 
g into register r1, but instead generates an instruction 
(addpc g-f . r1 ) that has adds a difference (g-f) in ad- 



dresses between function f and function g to the address 
of function f that is stored in register r1 , and has the re- 
sult transferred into register r1. Since the value of the 
PC relative value g-f is smaller that the absolute address 

5 g, the overall code size of programs can be reduced by 
usinqsuch addpc instructions. This has a further benefit 
for PIC codes where the addresses of a program in 
memory are determined when the program is executed, 
since calculation instructions that use such PC relative 

10 values must be used. 

[0289] In the same way as in the first embodiment, the 
assembler code produced by the compiler of the present 
embodiment is converted into object code that can be 
executed by the processor by an optimization apparatus 

15 303, an assembler 305 and a linker 307. The processor 
executes the PC adding instruction "addpc 9-f,r1 " in the 
generated object code using the second lower PC cal- 
culator 2800 and the second upper PC calculator 2802. 
In detail, the second lower PC calculator 2800 adds the 

20 lower 3 bits of the constant "g-f" and the lower 3 bits of 
the value stored in register r1 and sends any carry that 
is generated to the second upper PC calculator 2802. 
The second upper PC calculator 2802 adds the upper 
29 bits of the constant "g-f", the upper 29 bits of the value 

25 stored in register r1 , and any carry it has received from 
the second lower PC calculator 2800. A value given by 
setting the addition result of the second lower PC cal- 
culator 2800 as the lower 3 bits and the addition result 
of the second upper PC calculator 2802 as the upper 29 

30 bits is then set in register r1 . 

[0290] Note that while the instructions shown in Fig. 
35A and 35B respectively are an addition and a subtrac- 
tion of a constant and the value in a register, this need 
not be the case. An addition and a subtraction of values 

35 in registers, or an addition and a subtraction of a value 
in a register and the PC may equally be used. 
[0291] The calculation method used by the second 
lower PC calculator 2800 and the second upper PC cal- 
culator 2802 also need not be the carry method used in 

40 the first embodiment. Provided the same method is used 
by the optimization apparatus 303, assembler 305, and 
linker 307 that generate the object code to be executed 
by the processor, any of a no-carry method, a linear 
method, and an absolute value method may be used. 

45 

Eighth Embodiment 

[0292] The eighth embodiment of the present inven- 
tion relates to a debugger and a disassembler. 
so [0293] Fig. 41 is a block diagram showing the con- 
struction of the debugger and disassembler of the 
present embodiment. 

[0294] The input control unit 4000 receives an input 
from the user and controls the other components ac- 
55 cording to this input. 

[0295] The packet address specifying unit 4001 cal- 
culates the upper 29 bits of the address of the inputted 
instruction. 
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[0296] The in-packet address specifying unit 4002 
calculates the lower 3 bits of the address of the inputted 
instruction. 

[0297] The instruction memory 4004 stores the in- 
structions to be processed by the debugger and disas- 
sembler. As in the first embodiment, the addresses of 
instructions are 32 bits in length and are composed of 
a packet address as the upper 29 bits and an in-packet 
address as the lower 3 bits Fig. 41 shows how the in- 
structions shown in Fig. 25 are stored. 
[0298] The instruction reading unit 4003 reads an in- 
struction packet indicated by the packet address spec- 
ified by the packet address specifying unit 4001 from the 
instruction memory 4004. 

[0299] The instruction buffer 4005 stores the instruc- 
tion packet read from the instruction memory 4004 by 
the instruction reading unit 4003. 
[0300] The instruction decoding unit 4006 extracts the 
instruction unit with the in-packet address specified by 
the in-packet address specifying unit 4002 from the in- 
struction buffer 4005 and decodes the extracted instruc- 
tion unit. When the instruction unit is a branch instruc- 
tion, the instruction decoding unit 4006 sends the PC 
relative value 4007 to the lower PC calculator 4008 and 
the upper PC calculator 4009. 

[0301] The label table 4011 is a table storing each la- 
bel name associated with a corresponding instruction 
address. This label table 4011 is generated by extracting 
information from the optimized code when the assem- 
bler described in the first embodiment generates ma- 
chine language codes. 

[0302] In Fig. 41 , the address "32'h00000000" corre- 
sponds to the label f, the address "32'h00000008" cor- 
responds to the label L1, and the address 
"32'h 12345680" corresponds to the label L2. 
[0303] The display unit 401 2 displays the results of a 
disassembling of an instruction. 
[0304] The instruction replacing unit 4013 writes the 
instruction that has been replaced into the instruction 
unit(s) in the instruction buffer 4005 that is/are indicated 
by the in-packet address specified by the in-packet ad- 
dress specifying unit 4002. 

[0305] The instruction writing unit 4014 rewrites the 
instruction packet in the instruction memory 4004 with 
the packet address specified by the packet address 
specifying unit 4001 using the amended instruction 
packet stored in the instruction buffer 4005. 
[0306] The upper PC calculator 4009 performs a cal- 
culation on the upper 29 bits of the instruction address 
specified by the packet address specifying unit 4001 
and the upper 29 bits of the PC relative value 4007. 
[0307] The lower PC calculator 4008 performs a cal- 
culation on the lower 3 bits of the instruction address 
specified by the in-packet address specifying unit 4002 
and the lower 3 bits of the PC relative value 4007. The 
calculation methods used by these PC calculators is the 
same as that used when generating the object code. 
[0308] The following describes a specific example of 



the operation of the present disassembler. Fig. 42 is a 
flowchart showing the operating procedure of this dis- 
assembler. 

[0309] First, the input control unit 4000 receives a 
5 command indicating the disassembling of an instruction 
and an input of the address of the instruction to be dis- 
assembled. In this specific example, the input control 
unit 4000 receives "32'h0000001a" as the instruction 
address (step S4100). 
w [0310] Next, the packet address specifying unit 4001 
specifies the packet address from the upper 29 bits of 
the instruction address. The instruction reading unit 
4003 then reads the instruction packet with the specified 
packet address from the instruction memory 4004 and 
15 stores it in the instruction buffer 4005. In this example, 
"29'h00000003" is specified as the packet address, and 
the instruction sequence "1d (r2),r0llbra 13'h1fcclladd 
r2,r3" is stored in the instruction buffer 4005 (step 
S4101). 

20 [0311] The in-packet address specifying unit 4002 
then specifies the in-packet address from the lower 3 
bits of the instruction address and informs the instruction 
decoding unit 4006 of the instruction unit that has the 
specified in-packet address. The instruction decoding 

25 unit 4006 then extracts the indicated instruction unit 
from the instruction buffer 4005. In this example, 
"3'b01 0" is specified as the in-packet address and the 
instruction "bra 1 3'hlfec" that is the second unit in the 
instruction buffer 4005 is inputted into the instruction de- 

30 coding unit 4006 (step S4102). 

[0312] The instruction decoding unit 4006 judges 
whether the inputted instruction is a branch instruction. 
In this example, the inputted instruction "bra 1 3'hlfec" is 
a branch instruction, so that this judgement is true (step 

35 S4103). 

[0313] When the instruction is a branch instruction, a 
calculation is performed on the PC relative value 4007 
indicated in the instruction and address of the inputted 
instruction. The lower PC calculator 4008 performs an 
40 addition or a subtraction on the in-packet address of the 
inputted instruction and on the lower 3 bits of the PC 
relative value 4007 and sends the calculation result to 
the label search unit 4010. The upper PC calculator 
4009 performs an addition or a subtraction on the packet 
45 address of the inputted instruction and on the upper 29 
bits of the PC relative value 4007 and sends the calcu- 
lation result to the label search unit 4010. The label 
search unit 4010 specifies the address of a label from 
the calculation result for the upper bits and the calcula- 
te tion result for the lower bits. In this example, the label 
address "32'h00000008" is specified by a calculation us- 
ing the address "32'h0000001a" of the inputted instruc- 
tion and the PC relative value 4007 "1 3'hlfec" (steps 
S4103, S4104). 
55 [0314] The label search unit 4010 then refers to the 
label table 4011 and finds the label name that has the 
specified address. In this example, the label L1 corre- 
sponds to the address "32'h00000008" (Step S4107). 
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[0315] The display unit 4012 displays the assembler 
name of the branch instruction and the label name found 
by the label search unit 4010. In this example, the dis- 
play unit 401 2 displays the assembler name "bra" of the 
branch instruction and the corresponding label name 
"Label L1" (Step S4108). 

[0316] The instruction decoding unit 4006 has the dis- 
play unit 4012 display only the assembler name when 
the extracted instruction is not an assembler instruction 
(Step S4109). 

[0317] The following describes a specific example of 
the operation of the present debugger. 
[0318] Fig. 43 is a flowchart showing the operation of 
the present debugger. 

[0319] First, the input control unit 4000 receives a 
command indicating the debugging of an instruction, the 
address of an instruction to be replaced, and the instruc- 
tion to be used to replace of this instruction. In this spe- 
cific example, the input control unit 4000 receives 
"32'h0000001 a" as the instruction address and the sub- 
traction instruction "sub r0,r1" as the replacement in- 
struction (step S4200). 

[0320] Next, the packet address specifying unit 4001 
specifies the packet address from the upper 29 bits of 
the instruction address. The instruction reading unit 
4003 then reads the instruction packet with the specified 
packet address from the instruction memory 4004 and 
stores it in the instruction buffer 4005. In this example, 
"29'h00000003" is specified as the packet address, and 
the instruction sequence "1d (r2),r0llbra 13'h1feclladd 
r2,r3" is stored in the instruction buffer 4005 (step 
S4201). 

[0321] The in-packet address specifying unit 4002 
then specifies the in-packet address from the lower 3 
bits of the instruction address. In this example, the in- 
packet address "3'b010" is specified (step S4202). 
[0322] If the specified in-packet address is "3'bOOO", 
the first unit in the instruction packet in the instruction 
buffer 4005 is replaced with the inputted replacement 
instruction. If the specified in-packet address is 
"3'b010", the second unit in the instruction packet in the 
instruction buffer 4005 is replaced with the inputted re- 
placement instruction. If the specified in-packet address 
is "3'b100", the third unit in the instruction packet in the 
instruction buffer 4005 is replaced with the inputted re- 
placement instruction. In this example, the specified in- 
packet address is "3'b010", so that the instruction "bra 
13'hlfec" in the second unit in the instruction packet in 
the instruction buffer 4005 is replaced with the inputted 
replacement instruction "sub r0,r1". As a result, the in- 
struction packet in the instruction buffer 4005 becomes 
"1d (r2),r0llsub rO, rllladd r2,r3" (steps S4203 ~ 
S4207). 

[0323] The instruction writing unit 4014 replaces the 
instruction packet at the indicated packet address in the 
instruction memory 4004 with the instruction packet 
stored in the instruction buffer 4005. In this example, the 
instruction packet "1d (r2),r0llbra 13'h1feclladd r2,r3" at 
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the packet address "29'h00000003" in the instruction 
memory 4004 is replaced with the instruction packet "Id 
(r2),r0llsub rO, rllladd r2,r3" in the instruction buffer 
4005. 

5 [0324] As described above, the disassembler of the 
present embodiment can disassemble instructions that 
are executable for the processor 309 of the first embod- 
iment. When an instruction is disassembled, instead of 
just displaying the PC relative value, the disassembler 

to has the upper PC calculator and lower PC calculator cal- 
culate the address at which the label is located, uses 
the address to search the label table, and so displays 
1he appropriate label name. 

[0325] The debugger of the present embodiment 
15 reads instructions from the memory in units of instruc- 
tion packets that are byte-aligned, rewrites an instruc- 
tion in the instruction buffer, and writes the instructions 
back intothe memory in units of instruction packets. This 
method is suited to the debugging of instructions that 
20 are not byte-aligned. 

[0326] Note that the calculation methods used by the 
lower PC calculator and the upper PC calculator do not 
need to be the carry method described in the first em- 
bodiment, so that another method, such as a separation 
25 method, an absolute value method, or a linear method, 
can be used. 

[0327] The compiler optimization apparatus, assem- 
bler, linker, processor, disassembler, and debugger of 
the present invention have been explained by way of the 
30 first to eighth embodiments of the present invention, 
though it should be obvious that the present invention 
is not limited to these. Two example modifications are 
given below 

35 (1) In the first to sixth embodiments, the assembler 
code 302, the optimized code 304, the relocatable 
codes 306, and the object code 308 may be stored 
in a mask ROM, a semiconductor memory such as 
flash memory, a magnetic storage medium such as 
a floppy disk or a hard disk, or an optical disc such 
as a CD-ROM or DVD. 

(2) In the seventh embodiment, the assembler 
codes 2906 may be stored in a mask ROM, a sem- 
45 conductor memory such as flash memory, a mag- 
netic storage medium such as a floppy disk or a 
hard disk, or an optical disc such as a CD-ROM or 
DVD. 

so [0328] Although the present invention has been fully 
described by way of examples with reference to accom- 
panying drawings, it is to be noted that various changes 
and modifications will be apparent to those skilled in the 
art. Therefore, unless such changes and modifications 

55 depart from the scope of the present invention, they 
should be construed as being included therein. 
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1 . A processor for reading instructions from a memory 
according to a program counter, the memory storing 
instructions in one-byte units, and for executing the 
read instructions, 

the program counter including a first program 
counter and a second program counter, 
the first program counter indicating a storage 
position of a processing packet in the memory, 
the processing packet being composed of an 
integer number of the one-byte units, 
the second program counter indicating a posi- 
tion of processing target instruction in the 
processing packet, the processing target in- 
struction being an operation to be executed by 
the processor. 

2. The processor of Claim 1 , including a first program 
counter updating means and a second program 
counter updating means, 

the second program counter updating means 
incrementing a value of the second program 
counter in accordance with an amount of in- 
structions that were executed in a preceding cy- 
cle and sending any carry generated in an in- 
crementing to the first program counter updat- 
ing means, and 

the first program counter updating means add- 
ing the carry received from the second program 
counter updating means to the value of the first 
program counter. 

3. The processor of Claim 2, further including: 

program counter relative value extracting 
means for extracting, when an instruction being 
executed includes a program counter relative 
value that is based on an address of a first in- 
struction executed in a present cycle, the pro- 
gram counter relative value; and 
calculating means for adding the program 
counter relative value to the value of the first 
program counter and the value of the second 
program counter and setting an addition result 
as the value of the first program counter and 
the value of the second program counter. 

4. The processor of Claim 3, 

wherein the calculating means includes a first 
calculating unit and a second calculating unit, 
the second calculating unit adding the value of 
the second program counter and lower bits of 
the program counter relative value, setting a re- 
sult of an addition as the value of the second 



program counter, and sending any carry gener- 
ated in the addition to the first calculating unit, 
the first calculating unit adding the value of the 
first program counter, upper bits of the program 
counter relative value, and any carry received 
from the second calculating unit, and setting a 
result of an addition as the value of the first pro- 
gram counter. 

5. The processor of Claim 3, 

wherein the calculating means includes a first 
calculating unit and a second calculating unit, 
the second calculating unit adding the value of 
the second program counter and lower bits of 
the program counter relative value without gen- 
erating a carry, and setting a result of an addi- 
tion as the value of the second program coun- 
ter, 

the first calculating unit adding the value of the 
first program counter and upper bits of the pro- 
gram counter relative value, and setting a result 
of an addition as the value of the first program 
counter. 

6. The program counter of Claim 3, 

wherein the calculating means adds the value 
of the first program counter and upper bits of the 
program counter relative value, sets a result of an 
addition as the value of the first program counter, 
and sets lower bits of the program counter relative 
value as the value of the second program counter. 

7. The processor of Claim 3, 

wherein the calculating means adds the pro- 
gram counter relative value and a value whose up- 
per bits are the value of the first program counter 
and lower bits are the value of the second program 
counter, and sets upper bits of a result of an addition 
as the value of the first program counter and lower 
bits of the result as the second program counter. 

8. The processor of Claim 2, further including: 

program counter relative value extracting 
means for extracting, when an executed in- 
struction includes a program counter relative 
value that is based on an address of the exe- 
cuted instruction, the program counter relative 
value; 

program counter amending means for amend- 
ing the value of the first program counter and 
the value of the second program counter to in- 
dicate an address of the executed instruction; 
and 

calculating means for adding the program 
counter relative value, the value of the first pro- 
gram counter, and the value of the second pro- 
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gram counter, and setting a result of an addition 
as the value of the first program counter and 
the value of the second program counter. 

9. The processor of Claim 2, further including: 5 

program counter relative value calculating in- 
struction decoding means for decoding a pro- 
gram counter relative value calculating instruc- 
tion that performs an addition using a program io 
counter relative value and one of 

(a) a value of the program counter stored 
in a register, and 

(b) the value of the first program counter is 
and the value of the second program coun- 
ter; 

calculating means for performing the addition 
indicated by the program counter relative value 20 
calculating instruction to generate an addition 
result; and 

program counter value updating means for 
storing the addition result in one of 



processing target instruction in the processing 
packet; 

label detecting means 

(1) for detecting a label, which should be 
resolved by an address of a specified in- 
struction, from the instruction sequence, 
and obtaining the address of the specified 
instruction, and 

(2) for detecting a label, which should be 
resolved by a difference in addresses of 
two specified instructions, from the instruc- 
tion sequence, and obtaining the address- 
es of the two specified instructions; 

program counter relative value calculating 
means for calculating, when a label which 
should be resolved by a difference in addresses 
of two specified instructions has been detected, 
a program counter relative value by subtracting 
an address of one of the two specified instruc- 
tions from an address of another of the two 
specified instructions; 
converting means 



(a) the register, and 

(b) the first program counter and the sec- 
ond program counter. 

10. The processor of Claim 1 , 30 

wherein the first program counter indicates a 
memory address, the memory address being a stor- 
age position in the memory of a processing packet 
that is given by bit shifting the value in the first pro- 
gram counter by log 2 n bits in a leftward direction, n 35 
being a length of a processing packet in bytes. 

11. The processor of Claim 10, further including 

an instruction buffer for temporarily storing in- 40 
structions; and 

instruction reading means for transferring in- 
structions with a minimum transfer size of one 
one-byte unit from the memory to the instruc- 
tion buffer, in accordance with available space 45 
in the instruction buffer but regardless of a size 
of a processing packet. 

12. An instruction sequence optimizing apparatus, for 
generating optimized code from an instruction se- so 
quence, comprising: 

address assigning means for estimating a size 
of each instruction in the instruction sequence 
and assigning an address to each instruction, 55 
upper bits of each address indicating a memory 
address at which a processing packet is stored 
and lower bits of each address indicating a 



(1) for converting an instruction that has a 
label that should be resolved by an address 
of a specified instruction into an instruction 
with a size that is based on a size of the 
address of the specified instruction, 

(2) for converting an instruction that has a 
label that should be resolved by a differ- 
ence in addresses of two specified instruc- 
tions into an instruction with a size that is 
based on a size of the program counter rel- 
ative value calculated from the addresses 
of the two specified instructions; and 

optimized code generating means for generat- 
ing optimized code by converting addresses of 
instructions in accordance with the sizes of in- 
structions after conversion by the converting 
means. 

13. The instruction sequence optimizing apparatus of 
Claim 12, 

wherein the program counter relative value cal- 
culating means includes a lower bit subtracting 
unit and an upper bit subtracting unit, 
the lower bit subtracting unit subtracting lower 
bits of the address of the one of the two speci- 
fied instructions from lower bits of the address 
of the other of the two specified instructions, for 
setting a result of a subtraction as lower bits of 
the program counter relative value, and send- 
ing any carry generated in the subtraction to the 
upper bit subtracting unit, and 
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the upper bit subtracting unit subtracting upper 
bits of the address of one of the two specified 
instructions and any carry received from the 
lower bit subtracting unit from upper bits of the 
address of the other of the two specified in- 5 
structions, and for setting a result of a subtrac- 
tion as upper bits of the program counter rela- 
tive value. 

14. The instruction sequence optimizing apparatus of io 
Claim 12, 

wherein the program counter relative value cal- 
culating means includes a lower bit subtracting 
unit and an upper bit subtracting unit, is 
the lower bit subtracting unit subtracting lower 
bits of the address of one of the two specified 
instructions from lower bits of the address of the 
other of the two specified instructions without 
generating a carry and setting a result of a sub- 20 
traction as lower bits of the program counter rel- 
ative value, and 

the upper bit subtracting unit subtracting upper 
bits of the address of one of the two specified 
instructions from upper bits of the address of 25 
the other of the two specified instructions, and 
for setting a result of a subtraction as upper bits 
of the program counter relative value. 

15. The instruction sequence optimizing apparatus of 30 
Claim 12, 

wherein the program counter relative value 
calculating means subtracts upper bits of an ad- 
dress of one of the two specified instructions from 
upper bits of an address of the other of the two spec- 35 
ified instructions, sets a result of a subtraction as 
upper bits of the program counter relative value, 
and sets lower bits of the other of the two specified 
instructions as lower bits of the program counter rel- 
ative value. 40 

16. An assembler that generates relocatable code from 
an instruction sequence, each address of an in- 
struction in the instruction sequence having upper 
bits that indicate a memory address at which a 45 
processing packet is stored and lower bits that in- 
dicate a position of processing target instruction 
that is included in the processing packet, 

the assembler comprising: so 
label detecting means for detecting a label in 
the instruction sequence that should be re- 
solved by a difference in addresses between 
two specified instructions, and obtaining the 
addresses of the two specified instructions; 55 
program counter relative value calculating 
means for calculating a program counter rela- 
tive value by subtracting an address of one of 



the two specified instructions from an address 
of another of the two specified instructions; and 
replacing means for replacing the label with the 
program counter relative value calculated by 
the program counter relative value calculating 
means. 

17. The assembler of Claim 16, 

wherein the program counter relative value cal- 
culating means includes a lower bit subtracting 
unit and an upper bit subtracting unit, 
the lower bit subtracting unit subtracting lower 
bits of the address of the one of the two speci- 
fied instructions from lower bits of the address 
of the other of the two specified instructions, for 
setting a result of a subtraction as lower bits of 
the program counter relative value, and send- 
inq any carry generated in the subtraction to the 
upper bit subtracting unit, and 
the upper bit subtracting unit subtracting upper 
bits of the address of one of the two specified 
instructions and any carry received from the 
lower bit subtracting unit from upper bits of the 
address of the other of the two specified in- 
structions, and for setting a result of a subtrac- 
tion as upper bits of the program counter rela- 
tive value. 

18. The assembler of Claim 16, 

wherein the program counter relative value cal- 
culating means includes a lower bit subtracting 
unit and an upper bit subtracting unit, 
the lower bit subtracting unit subtracting lower 
bits of the address of one of the two specified 
instructions from lower bits of the address of the 
other of the two specified instructions without 
generating a carry and setting a result of a sub- 
traction as lower bits of the program counter rel- 
ative value, and 

the upper bit subtracting unit subtracting upper 
bits of the address of one of the two specified 
instructions from upper bits of the address of 
the other of the two specified instructions, and 
for setting a result of a subtraction as upper bits 
of the program counter relative value. 

19. The assembler of Claim 16, 

wherein the program counter relative value 
calculating means subtracts upper bits of an ad- 
dress of one of the two specified instructions from 
upper bits of an address of the other of the two spec- 
ified instructions, sets a result of a subtraction as 
upper bits of the program counter relative value, 
and sets lower bits of the other of the two specified 
instructions as lower bits of the program counter rel- 
ative value. 
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20. A linker that generates object code by combining 
relocatable code, each address of an instruction in 
the relocatable code having upper bits that indicate 
a memory address at which a processing packet is 
stored and lower bits that indicate a position of 5 
processing target instruction that is included in the 
processing packet, 

the linker comprising: 

relocation information detecting means for de- io 
tecting a label in the relocatable code that 
should be resolved by a difference in addresses 
between two specified instructions, and obtain- 
ing the addresses of the two specified instruc- 
tions; is 
program counter relative value calculating 
means for calculating a program counter rela- 
tive value by subtracting an address of one of 
the two specified instructions from an address 
of another of the two specified instructions; and 20 
replacing means for replacing the label with the 
program counter relative value calculated by 
the program counter relative value calculating 
means. 

25 

21. The linker of Claim 20, 

wherein the program counter relative value cal- 
culating means includes a lower bit subtracting 
unit and an upper bit subtracting unit, 30 
the lower bit subtracting unit subtracting lower 
bits of the address of the one of the two speci- 
fied instructions from lower bits of the address 
of the other of the two specified instructions, for 
setting a result of a subtraction as lower bits of 35 
the program counter relative value, and send- 
ing any carry generated in the subtraction to the 
upper bit subtracting unit, and 
the upper bit subtracting unit subtracting upper 
bits of the address of one of the two specified 40 
instructions and any carry received from the 
lower bit subtracting unit from upper bits of the 
address of the other of the two specified in- 
structions, and for setting a result of a subtrac- 
tion as upper bits of the program counter rela- 45 
tive value. 

22. The linker of Claim 20, 

wherein the program counter relative value cal- so 
culating means includes a lower bit subtracting 
unit and an upper bit subtracting unit, 
the lower bit subtracting unit subtracting lower 
bits of the address of one of the two specified 
instructions from lower bits of the address of the 55 
other of the two specified instructions without 
generating a carry and setting a result of a sub- 
traction as lower bits of the program counter rel- 
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ative value, and 

the upper bit subtracting unit subtracting upper 
bits of the address of one of the two specified 
instructions from upper bits of the address of 
the other of the two specified instructions, and 
for setting a result of a subtraction as upper bits 
of the program counter relative value. 

23. The linker of Claim 20, 

wherein the program counter relative value 
calculating means subtracts upper bits of an ad- 
dress of one of the two specified instructions from 
upper bits of an address of the other of the two spec- 
ified instructions, sets a result of a subtraction as 
upper bits of the program counter relative value, 
and sets lower bits of the other of the two specified 
instructions as lower bits of the program counter rel- 
ative value. 

24. A disassembler that receives an indication of an ad- 
dress of an instruction in object code and outputs 
an assembler name of the instruction at the indicat- 
ed address, each address of an instruction in the 
object code having upper bits that indicate a mem- 
ory address at which a processing packet is stored 
and lower bits that indicate a position of processing 
target instruction that is included in the processing 
packet, 

the disassembler comprising: 
program counter relative value extracting 
means for extracting, when the indicated in- 
struction includes a program counter relative 
value, the program counter relative value from 
the indicated instruction; 
label addressing calculating means for adding 
an address of the indicated instruction to the 
extracted program counter relative value and 
setting an addition result as a label address; 
storing means for storing a label name corre- 
sponding to each label address; and 
searching means for searching the storing 
means for a label name that corresponds to the 
calculated label address and outputting the cor- 
responding label name. 

25. The disassembler of Claim 24, 

wherein the label address calculating means in- 
cludes a lower bit calculating unit and an upper 
bit calculating unit, 

the lower bit calculating unit for adding lower 
bits of the address of the indicated instruction 
and lower bits of the program counter relative 
value, setting a result of an addition as lower 
bits of a label address, and sending any carry 
generated by the addition to the upper bit cal- 
culating unit, and 
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the upper bit calculating unit adding upper bits 
of the address of the indicated instruction, up- 
per bits of the program counter relative value, 
and any carry received from the lower bit cal- 
culating unit, and setting a result of the an ad- 
dition as upper bits of the label address. 

26. The disassembler of Claim 24, 

wherein the label address calculating means in- 
cludes a lower bit calculating unit and an upper 
bit calculating unit, 

the lower bit calculating unit adding lower bits 
of the address of the indicated instruction and 
lower bits of the program counter relative value 
without generating a carry, and setting a result 
of an addition as lower bits of a label address, 
and 

the upper bit calculating unit adding upper bits 
of the address of the indicated instruction and 
upper bits of the program counter relative val- 
ue, and setting a result of an addition as upper 
bits of the label address. 

27. The disassembler of Claim 24, wherein 

the label address calculating means adds up- 
per bits of the address of the indicated instruction 
and upper bits of the program counter relative val- 
ue, sets a result of an addition as upper bits of the 
label address, and sets lower bits of the program 
counter relative value as lower bits of the label ad- 
dress. 

28. A debugger that receives an indication of an ad- 
dress of an instruction in object code and replaces 
the instruction at the indicated address with a re- 
placement instruction, each address of an instruc- 
tion in the object code having upper bits that indi- 
cate a memory address at which a processing pack- 
et is stored and lower bits that indicate a position of 
processing target instruction that is included in the 
processing packet, 

the debugger comprising: 
processing packet reading means for reading 
a processing packet that is indicated by upper 
bits of the indicated address from the memory 
and writing the processing packet into an in- 
struction buffer; 

instruction writing means for writing the re- 
placement instruction into the processing pack- 
et in the instruction buffer over an instruction 
that is indicated by the lower bits of the indicat- 
ed address; and 

processing packet writing means for writing the 
processing packet in the instruction buffer back 
into the memory after the replacement instruc- 
tion has been written. 



29. A compiler that generates an instruction sequence 
from source code, 

the compiler generating a program counter rel- 
ative value calculating instruction that is exe- 
cuted by a processor, the program counter rel- 
ative value calculating instruction being an in- 
struction that performs a calculation using a 
first value and a program counter relative value 
and uses a result of the calculation to update 
the first value, the first value being one of 

(a) a value of a program counter stored in 
a register, and 

(b) the value stored in a program counter 
of the processor, 

wherein upper bits of the first value indicate a 
memory address at which a processing packet 
is stored, and lower bits of the first value of the 
program counter indicate a processing target 
instruction that is included in the processing 
packet. 

25 30. The compiler of Claim 29, 

wherein the processor includes a lower bit cal- 
culating unit and an upper bit calculating unit, 
the program counter relative value calculating 
30 instruction having the lower bit calculating unit 

perform a lower bit calculation and the upper 
bit calculating unit perform an upper bit calcu- 
lation, 

the lower bit calculation being an addition using 
lower bits of the first value and lower bits of the 
value of the program counter relative value, 
where a result of the lower bit calculation is set 
as the lower bits of the first value and any gen- 
erated carry is sent to the upper bit calculating 
unit, and 

the upper bit calculation being an addition using 
upper bits of the first value, upper bits of the 
value of the program counter relative value and 
any carry received from the lower bit calculating 
unit, where a result of the upper bit calculation 
is set as the upper bits of the first value. 

31. The compiler of Claim 29, 

wherein the processor includes a lower bit cal- 
culating unit and an upper bit calculating unit, 
the program counter relative value calculating 
instruction having the lower bit calculating unit 
perform a lower bit calculation and the upper 
bit calculating unit perform an upper bit calcu- 
lation, 

the lower bit calculation being an addition using 
lower bits of the first value and lower bits of the 
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value of the program counter relative value that 
does not generate a carry, where a result of the 
lower bit calculation is set as the lower bits of 
the first value, and 

the upper bit calculation being a calculation us- 
ing upper bits of the first value and upper bits 
of the value of the program counter relative val- 
ue, where a result of the upper bit calculation 
is set as the upper bits of the first value. 

32. The compiler of Claim 29, 

wherein the processor includes an upper bit 
calculating unit, 

the program counter relative value calculating 
instruction having the upper bit calculating unit 
perform an upper bit calculation and setting 
lower bits of the program counter relative value 
as lower bits of the first value, and 
the upper bit calculation being an addition using 
upper bits of the first value and upper bits of the 
value of the program counter relative value, 
where a result of the upper bit calculation is set 
as the upper bits of the first value. 

33. A computer-readable recording medium storing an 
instruction sequence optimizing program that gen- 
erates optimized code from an instruction se- 
quence, the instruction sequence optimizing pro- 
gram including: 

an address assigning step for estimating a size 
of each instruction in the instruction sequence 
and assigning an address to each instruction, 
upper bits of each address indicating a memory 
address at which a processing packet is stored 
and lower bits of each address indicating a 
processing target instruction in the processing 
packet; 

a label detecting step 

(1) for detecting a label, which should be 
resolved by an address of a specified in- 
struction, from the instruction sequence, 
and obtaining the address of the specified 
instruction, and 

(2) for detecting a label, which should be 
resolved by a difference in addresses of 
two specified instructions, from the instruc- 
tion sequence, and obtaining the address- 
es of the two specified instructions; 

a program counter relative value calculating 
step for calculating, when a label which should 
be resolved by a difference in addresses of two 
specified instructions has been detected, a pro- 
gram counter relative value by subtracting an 
address of one of the two specified instructions 



from an address of another of the two specified 

instructions; 

a converting step 

(1) for converting an instruction that has a 
label that should be resolved by an address 
of a specified instruction into an instruction 
with a size that is based on a size of the 
address of the specified instruction, 

(2) for converting an instruction that has a 
label that should be resolved by a differ- 
ence in addresses of two specified instruc- 
tions into an instruction with a size that is 
based on a size of the program counter rel- 
ative value calculated from the addresses 
of the two specified instructions; and 

an optimized code generating step for generat- 
ing optimized code by converting addresses of 
instructions in accordance with the sizes of in- 
structions after conversion in the converting 
step. 

34. The computer-readable recording medium of Claim 



wherein the program counter relative value cal- 
culating step includes a lower bit subtracting 
substep and an upper bit subtracting substep, 
the lower bit subtracting substep subtracting 
lower bits of the address of the one of the two 
specified instructions from lower bits of the ad- 
dress of the other of the two specified instruc- 
tions, for setting a result of a subtraction as low- 
er bits of the program counter relative value, 
and sending any carry generated in the sub- 
traction to the upper bit subtracting substep, 
and 

the upper bit subtracting substep subtracting 
upper bits of the address of one of the two spec- 
ified instructions and any carry received from 
the lower bit subtracting substep from upper 
bits of the address of the other of the two spec- 
ified instructions, and for setting a result of a 
subtraction as upper bits of the program coun- 
ter relative value. 

35. The computer-readable recording medium of Claim 



wherein the program counter relative value cal- 
culating step includes a lower bit subtracting 
substep and an upper bit subtracting substep, 
the lower bit subtracting substep subtracting 
lower bits of the address of one of the two spec- 
ified instructions from lower bits of the address 
of the other of the two specified instructions 
without generating a carry and setting a result 
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of a subtraction as lower bits of the program 
counter relative value, and 
the upper bit subtracting substep subtracting 
upper bits of the address of one of the two spec- 
ified instructions from upper bits of the address 
of the other of the two specified instructions, 
and for setting a result of a subtraction as upper 
bits of the program counter relative value. 

36. The computer-readable recording medium of Claim 

33, 

wherein the program counter relative value 
calculating step subtracts upper bits of an address 
of one of the two specified instructions from upper 
bits of an address of the other of the two specified 
instructions, sets a result of a subtraction as upper 
bits of the program counter relative value, and sets 
lower bits of the other of the two specified instruc- 
tions as lower bits of the program counter relative 
value. 

37. A computer-readable recording medium storing an 
assembler program that generates relocatable 
code from optimized code that have been generat- 
ed from an instruction sequence, each address of 
an instruction in the optimized code having upper 
bits that indicate a memory address at which a 
processing packet is stored and lower bits that in- 
dicate a position of processing target instruction 
that is included in the processing packet, 

the assembler program comprising: 
a label detecting step for detecting a label in the 
instruction sequence that should be resolved 
by a difference in addresses between two spec- 
ified instructions, and obtaining the addresses 
of the two specified instructions; 
a program counter relative value calculating 
step for calculating a program counter relative 
value by subtracting an address of one of the 
two specified instructions from an address of 
another of the two specified instructions; and 
a replacing step for replacing the label with the 
program counter relative value calculated by 
the program counter relative value calculating 
step. 

38. The computer-readable recording medium of Claim 
37, 

wherein the program counter relative value cal- 
culating step includes a lower bit subtracting 
substep and an upper bit subtracting substep, 
the lower bit subtracting substep subtracting 
lower bits of the address of the one of the two 
specified instructions from lower bits of the ad- 
dress of the other of the two specified instruc- 
tions, for setting a result of a subtraction as low- 



er bits of the program counter relative value, 
and sending any carry generated in the sub- 
traction to the upper bit subtracting substep, 
and 

5 the upper bit subtracting substep subtracting 

upper bits of the address of one of the two spec- 
ified instructions and any carry received from 
the lower bit subtracting substep from upper 
bits of the address of the other of the two spec- 
ie ified instructions, and for setting a result of a 
subtraction as upper bits of the program coun- 
ter relative value. 

39. The computer-readable recording medium of Claim 
J5 37, 

wherein the program counter relative value cal- 
culating step includes a lower bit subtracting 
substep and an upper bit subtracting substep, 
20 the lower bit subtracting substep subtracting 

lower bits of the address of one of the two spec- 
ified instructions from lower bits of the address 
of the other of the two specified instructions 
without generating a carry and setting a result 
25 of a subtraction as lower bits of the program 

counter relative value, and 
the upper bit subtracting substep subtracting 
upper bits of the address of one of the two spec- 
ified instructions from upper bits of the address 
30 of the other of the two specified instructions, 

and for setting a result of a subtraction as upper 
bits of the program counter relative value. 

40. The computer-readable recording medium of Claim 
35 37, 

wherein the program counter relative value 
calculating step subtracts upper bits of an address 
of one of the two specified instructions from upper 
bits of an address of the other of the two specified 
40 instructions, sets a result of a subtraction as upper 
bits of the program counter relative value, and sets 
lower bits of the other of the two specified instruc- 
tions as lower bits of the program counter relative 
value. 

45 

41. A computer-readable recording medium storing a 
linker program that generates object code from re- 
locatable code that has been generated from an in- 
struction sequence, each address of an instruction 

so in the optimized code having upper bits that indicate 
a memory address at which a processing packet is 
stored and lower bits that indicate a position of 
processing target instruction that is included in the 
processing packet, 

55 

the linker program comprising: 
a relocation information detecting step for de- 
tecting a label in the relocatable code that 



20 



25 



34 
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should be resolved by a difference in addresses 
between two specified instructions, and obtain- 
ing the addresses of the two specified instruc- 
tions; 

a program counter relative value calculating 5 
step for calculating a program counter relative 
value by subtracting an address of one of the 
two specified instructions from an address of 
another of the two specified instructions; and 
a replacing step for replacing the label with the 10 
program counter relative value calculated by 
the program counter relative value calculating 
step. 

42. The computer-readable recording medium of Claim is 



wherein the program counter relative value cal- 
culating step includes a lower bit subtracting 
substep and an upper bit subtracting substep, 
the lower bit subtracting substep subtracting 
lower bits of the address of the one of the two 
specified instructions from lower bits of the ad- 
dress of the other of the two specified instruc- 
tions, for setting a result of a subtraction as low- 
er bits of the program counter relative value, 
and sending any carry generated in the sub- 
traction to the upper bit subtracting substep, 
and 

the upper bit subtracting substep subtracting 
upper bits of the address of one of the two spec- 
ified instructions and any carry received from 
the lower bit subtracting substep from upper 
bits of the address of the other of the two spec- 
ified instructions, and for setting a result of a 
subtraction as upper bits of the program coun- 
ter relative value. 



44. The computer-readable recording medium of Claim 
41, 

wherein the program counter relative value 
calculating step subtracts upper bits of an address 
of one of the two specified instructions from upper 
bits of an address of the other of the two specified 
instructions, sets a result of a subtraction as upper 
bits of the program counter relative value, and sets 
lower bits of the other of the two specified instruc- 
tions as lower bits of the program counter relative 
value. 

45. A computer-readable recording medium storing a 
compiler program that generates an instruction se- 
quence from source code, 

the compiler program generating a program 
counter relative value calculating instruction 
that is executed by a processor, the program 
counter relative value calculating instruction 
being an instruction that performs a calculation 
using a first value and a program counter rela- 
tive value and uses a result of the calculation 
to update the first value, the first value being 
one of 

(a) a value of a program counter stored in 
a register and 

(b) the value stored in a program counter 
of the processor, 

wherein upper bits of the first value indicate a 
memory address at which a processing packet 
is stored, and lower bits of the first value of the 
program counter indicate a processing target 
instruction that is included in the processing 
packet. 



43. The computer-readable recording medium of Claim 46. The computer-readable recording medium of Claim 



wherein the program counter relative value cal- 
culating step includes a lower bit subtracting 
substep and an upper bit subtracting substep, 
the lower bit subtracting substep subtracting 45 
lower bits of the address of one of the two spec- 
ified instructions from lower bits of the address 
of the other of the two specified instructions 
without generating a carry and setting a result 
of a subtraction as lower bits of the program so 
counter relative value, and 
the upper bit subtracting substep subtracting 
upper bits of the address of one of the two spec- 
ified instructions from upper bits of the address 
of the other of the two specified instructions, ss 
and for setting a result of a subtraction as upper 
bits of the program counter relative value. 



wherein the processor includes a lower bit cal- 
culating unit and an upper bit calculating unit, 
the program counter relative value calculating 
instruction having the lower bit calculating unit 
perform a lower bit calculation and the upper 
bit calculating unit perform an upper bit calcu- 
lation, 

the lower bit calculation being an addition using 
lower bits of the first value and lower bits of the 
value of the program counter relative value, 
where a result of the lower bit calculation is set 
as the lower bits of the first value and any gen- 
erated carry is sent to the upper bit calculating 
unit, and 

the upper bit calculation being an addition using 
upper bits of the first value, upper bits of the 
value of the program counter relative value and 
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any carry received from the lower bit calculating 
unit, where a result of the upper bit calculation 
is set as the upper bits of the first value. 

47. The computer-readable recording medium of Claim s 
45, 

wherein the processor includes a lower bit cal- 
culating unit and an upper bit calculating unit, 
the program counter relative value calculating io 
instruction having the lower bit calculating unit 
perform a lower bit calculation and the upper 
bit calculating unit perform an upper bit calcu- 
lation, 

the lower bit calculation being an addition using is 
lower bits of the first value and lower bits of the 
value of the program counter relative value that 
does not generate a carry, where a result of the 
lower bit calculation is set as the lower bits of 
the first value, and 20 
the upper bit calculation being a calculation us- 
ing upper bits of the first value and upper bits 
of the value of the program counter relative val- 
ue, where a result of the upper bit calculation 
is set as the upper bits of the first value. 25 

48. The computer-readable recording medium of Claim 
45, 

wherein the processor includes an upper bit 30 
calculating unit, 

the program counter relative value calculating 
instruction having the upper bit calculating unit 
perform an upper bit calculation and setting 
lower bits of the program counter relative value 35 
as lower bits of the first value, and 
the upper bit calculation being an addition using 
upper bits of the first value and upper bits of the 
value of the program counter relative value, 
where a result of the upper bit calculation is set 40 
as the upper bits of the first value. 
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DISPLAYED 1 




ASSEMBLER INSTRUCTION ANDl 
LABEL NAME DISPLAYED | 






1 


™" ^ 



END 
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FIG. 43 
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INSTRUCTION ADDRESS AND f 
REPLACEMENT INSTRUCTION INPUTTED I 



^S4201 



PACKET ADDRESS SPECIFIED, INSTRUCTION | 
PACKET STORED IN INSTR UCT I ON BUFF ER^ | 



A 
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jlN-PACKET ADDRESS SPECIFIED] 
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INSTRUCTION IN FIRST 
UNIT REPLACED WITH 
REPLACEMENT 
INSTRUCTION 



Yes 
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INSTRUCTION IN THIRD UNIT 
REPLACED WITH REPLACEMENT 
INSTRUCTION 



INSTRUCTION IN 
SECOND UNIT 
REPLACED WITH 
REPLACEMENT 
INSTRUCTION 



..S4208 



INSTRUCTION PACKET IN THE INSTRUCTION 
MEMORY REPLACED WITH INSTRUCTION 
PACKET IN THE INSTRUCTION BUFFER 



END 



76 



